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Abstract 

This paper presents a model and implementation techniques for speed- 
ing up constraint propagation. Three fundamental approaches to improv- 
ing constraint propagation based on propagators as implementations of 
constraints are explored: keeping track of which propagators are at fix- 
point, choosing which propagator to apply next, and how to combine 
several propagators for the same constraint. 

We show how idempotence reasoning and events help track fixpoints 
more accurately. We improve these methods by using them dynamically 
(taking into account current domains to improve accuracy). We define 
priority-based approaches to choosing a next propagator and show that 
dynamic priorities can improve propagation. We illustrate that the use of 
multiple propagators for the same constraint can be advantageous with 
priorities, and introduce staged propagators that combine the effects of 
multiple propagators with priorities for greater efficiency. 

1 Introduction 

We consider the problem of solving Constraint Satisfaction Problems (CSPs) 
defined in the sense of Mackworth [21], which can be stated briefly as follows: 

We are given a set of variables, a domain of possible values for each 
variable, and a set (read as conjunction) of constraints. Each con- 
straint is a relation defined over a subset of the variables, limiting 
the combination of values that the variables in this subset can take. 
The goal is to find a consistent assignment of values to the variables 
so that all the constraints are satisfied simultaneously. 
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One widely-adopted approach to solving CSPs combines backtracking tree search 
with constraint propagation. This framework is realized in finite domain con- 
straint programming systems, such as SICStus Prolog [IS], ILOG Solver [T?] . 
and Gecode [T2] that have been successfully applied to many real-life industrial 
applications. 

At the core of a finite domain constraint programming system is a constraint 
propagation engine that repeatedly executes propagators for the constraints of a 
problem. Propagators discover and remove values from the domains of variables 
that can no longer take part in a solution of the constraints. 

Example 1.1 Consider a simple CSP, with variables x\, X2, and x 3 whose 
domain of possible values are respectively X\ £ {2,3,4}, X2 £ {0,1,2,3}, 23 £ 
{ — 1, 0, 1, 2} and the constraints are x 3 = £2, x\ < £2 + 1, and X\ 7^ 3. 

A propagator for x 3 = x-i can determine that x-i ^ 3 in any solution of this 
constraint since £3 cannot take the value 3. Similarly x 3 ^ — 1. The propagator 
then reduces the domains of the variables to X2 £ {0, 1, 2} and x 3 £ {0, 1, 2}. A 
propagator for x\ < X2 + 1 can determine that x\ ^ 4 since X2 > 3, and X2 ^ 
since x\ < 1 so the domains are reduced to x\ £ {2,3} and x 2 £ {1,2}. The 
propagator for xi ^ 3 can remove the value 3 from the domain of x%, leaving 
x\ £ {2} (or x\ =2). If we now reconsider the propagator for x 3 = X2 we can 
reduce domains to X2 £ {1,2} and X3 £ {1,2}. No propagator can remove any 
further values. 

We have not solved the problem, since we do not know a value for each vari- 
able. So after propagation we apply search, usually by splitting the domain of a 
variable into two disjoint subsets and considering the resulting two subproblems. 

Suppose we split the domain of x 2 - One subproblem has X\ £ {2}, x 2 £ {1} 
and x 3 £ {1,2}. Applying the propagator for x 2 = x 3 results in x 3 £ {2}. Since 
each variable now takes a fixed value we can check that x\ = 2, X2 = 1, £3 = 1 
is a solution to the CSP. The other subproblem has x\ £ {2}, X2 £ {2} and 
£3 £ {1,2}, and leads to another solution. □ 

As can be seen from the example finite domain constraint programming 
interleaves propagation with search. In this paper we investigate how to make 
a propagation engine as efficient as possible. 

There are two important decisions the engine must make: which propagators 
should execute, and in which order they should execute. In order to make 
constraint propagation efficient, it is clear that the engine needs to take the 
following issues into account: avoid unnecessary propagator execution, restrict 
propagation to relevant variables, and choose the cheapest possible method for 
propagation. In this paper we show how propagation can be speeded up if the 
engine takes these issues into account. 

The contributions of the paper are as follows: 

• We give a formal definition of propagation systems including fixpoint and 
event-based optimizations used in current propagation systems. 

• We extend event-based propagation systems to use dynamically changing 
event sets. 
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• We introduce multiple propagators and staged propagators for a single 
constraint for use with propagation queues with priority. 

• We give experimental results that clarify the impact of many choices in im- 
plementing propagation engines: including idempotence reasoning, static 
and dynamic events, basic queuing strategies, priority queues, and staged 
propagation. 

Plan of the paper The next section introduces propagation-based constraint 
solving, followed by a model for constraint propagation systems in Section [3J 
Section [4] presents how to optimize propagation by taking idempotence into ac- 
count, while Section O explores the use of event sets. Which propagator should 
be executed next is discussed in Section [BJ while combination strategies of mul- 
tiple propagators for the same constraint is discussed in Section [7J Experiments 
for each feature are included in the relevant section, and a summary is given in 
Section [8] Section [9] concludes. 

2 Propagation-based Constraint Solving 

This section defines our terminology for the basic components of a constraint 
propagation engine. In this paper we restrict ourselves to finite domain integer 
constraint solving. Almost all the discussion applies to other forms of finite 
domain constraint solving such as for sets and multisets. 

Domains A domain D is a complete mapping from a fixed (finite) set of 
variables V to finite sets of integers. A false domain D is a domain with D{x) = 
for some x £ V. A variable x £ V is fixed by a domain D, if |.D(x)| = 1. The 
intersection of domains D\ and D2, denoted D\ IID2, is defined by the domain 
D(x) = Di(x) n D 2 {x) for all 16V. 

A domain D\ is stronger than a domain D2, written D\ C D 2 , if D\(x) C 
D2(x) for all x £ V. A domain D\ is stronger than (equal to) a domain D2 
w.r.t. variables V, denoted D\ Cy D 2 (resp. D\ =y D 2 ), if D\(x) C D 2 (x) 
(resp. Di(x) = D 2 (x)) for all x £ V. 

A range is a contiguous set of integers, we use range notation [I .. u] to denote 
the range {d £ Z, \ I < d < u} when / and u are integers. A domain is a range 
domain if D(x) is a range for all x. Let D' = range(D) be the smallest range 
domain containing D, that is, the unique domain D'(x) = [vol D(x) .. supZ?(.T)] 
for all x £ V. 

We shall be interested in the notion of an initial domain, which we denote 
Z?i n j t . The initial domain gives the initial values possible for each variable. It 
allows us to restrict attention to domains D such that D C -Dimt- 

Valuations and constraints An integer valuation 9 is a mapping of variables 
to integer values, written {x\ 1— > d\, ... ,x n 1— > d n }. We extend the valuation 9 
to map expressions and constraints involving the variables in the natural way. 
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Let vars be the function that returns the set of variables appearing in a 
valuation. We define a valuation 9 to be an element of a domain D, written 
9 G D, if 9(xi) G D{xi) for all x t G vars(0). 

The inftmum and supremum of an expression e with respect to a domain D 
are defined as inf^ e = inf {9(e)\9 G D} and sup D e = sup {9(e)\9 G D}. 

We can map a valuation 9 to a domain Dg as follows 

D 9 (x) = { ^vars(tf) 
\ -Dinit(2;) otherwise 

A constraint c over variables x\, . . . ,x n is a set of valuations 9 such that 
vars(#) = {x\, . . . ,x n }. We also define vars(c) = {x\, . . . ,x n }. 



Propagators We will implement a constraint c by a set of propagators prop(c) 
that map domains to domains. A propagator f is a monotonically decreasing 
function from domains to domains: f(D) C D, and f{D{) C f(D 2 ) whenever 
Di C D2. A propagator / is correct for a constraint c iff for all domains D 

{9\9 ED}C\c={9\9 G /(£>)} He 

This is a very weak restriction, for example the identity propagator is correct 
for all constraints c. 

A set of propagators F is checking for a constraint c, if for all valuations 9 
where vars(#) = vars(c) the following holds: /(-De) = Dg for all / G F, iff 9 G c. 
That is, for any domain D# corresponding to a valuation on vars(c), f(Dg) is a 
fixpoint iff 9 is a solution of c. We assume that prop(c) is a set of propagators 
that is correct and checking for c. 

The output variables output (/) C V of a propagator / are the variables 
changed by the propagator: x G output (/) if there exists a domain D such that 
f(D)(x) ^ D(x). The input variables input (/) C V of a propagator / is the 
smallest subset V C V such that for each domain D: D —y D' implies that 
D'nf(D) =output(/) f(D')nD. Only the input variables are useful in computing 
the application of the propagator to the domain. 

Example 2.1 [Propagators, input, and output] For the constraint c = 
x\ < x 2 + 1 the function Ja defined by /a(D) (xi) — {d G D{x\) \ d < sup D x 2 + 
1} and Ja(D)(v) = D(v),v ^ x\ is a correct propagator for c. Its output 
variables are {x\\ and its input variables are {x 2 }- Let Di(x\) = {1,5,8} and 
D1OE2) = {1,5}, then /(Di) = D 2 where D 2 {x 1 ) = D 2 (x 2 ) = {1,5}. 

The propagator fs defined as fB{D){x 2 ) = {d G D(x 2 ) \ d > info a; 1 — 1} 
and /b(D)(u) = D(v),v ^ x 2 is another correct propagator for c. Its output 
variables are {x 2 } and input variables {xi}. 

The set {/a,/b} is checking for c. The domain Dg x (xi) = Dg 1 (x 2 ) = {2} 
corresponding to the solution 9\ = {x\ 2, £2 |—> 2} of c is a fixpoint of 
both propagators. The non-solution domain Dq 2 (x\) = {2}, Dg 2 (x 2 ) = {0} 
corresponding to the valuation 9 2 = {x\ ^ 2,x 2 0} is not a fixpoint (of 
either propagator). □ 
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A propagation solver so\v(F, D) for a set of propagators F and an initial 
domain D finds the greatest mutual fixpoint of all the propagators / £ F. In 
other words, solv(F, D) returns a new domain defined by 

so\v(F,D) =gfp(Ad.iter(F,d))(£>) iter(F,£>)= l~l /(£>) 

where gfp denotes the greatest fixpoint w.r.t C lifted to functions. 

Note that by inverting the direction of C we could equally well phrase this 
as a least fix point (as in pQ). But the current presentation emphasizes the 
reduction of domains as computation progresses. 



Domain and bounds propagators A consistency notion C gives a condition 
on domains with respect to constraints. A set of propagators F maintains 
C '-consistency for a constraint c, if solv(F, D) is always C consistent for c. 
Many propagators in practice are designed to maintain some form of consistency: 
usually domain or bounds. But note that many more do not. 

The most successful consistency technique was arc consistency [ST], which 
ensured that for each binary constraint, every value in the domain of the first 
variable, has a supporting value in the domain of the second variable that sat- 
isfied the constraint. Arc consistency can be naturally extended to constraints 
of more than two variables. This extension has been called generalized arc con- 
sistency |24j , as well as domain consistency [331 134] (which is the terminology 
we will use), and hyper-arc consistency [22] . A domain D is domain consistent 
for a constraint c if D is the least domain containing all solutions 9 € D of c, 
that is, there does not exist D' C D such that 6<eDA6<Ec^9<eD'. 

Define the domain propagator dom(c), for a constraint c as 

dom(c)(-D)(x) = {0{x) \ 6eD/\6ec} where x 6 vars(c) 
dom(c)(-D)(x) = D(x) otherwise 

The basis of bounds consistency is to relax the consistency requirement to 
apply only to the lower and upper bounds of the domain of each variable x. 
There are a number of different notions of bounds consistency [5] , we give the 
two most common here. 

A domain D is bounds (Z) consistent for a constraint c with vars(c) = 
{xi, . . . , x n }, if for each variable Xi, 1 < i < n and for each d, £ {inf r> x%, sup D Xi} 
there exist integers dj with inf/) Xj < dj < sup D Xj,\<j<n,j^i such that 
9 = {xi i— > di , . . . , x n I— > d n } is an integer solution of c. 

A domain D is bounds (R) consistent for a constraint c with vars(c) = 
{xi, . . . , x n }, if for each variable Xi, 1 < i < n and for each di G {inf £> a?i, sup^ x{\ 
there exist real numbers dj with inf^ Xj < dj < sup D Xj, 1 < j < n,j ^ i such 
that 6 = {xi i— y d\ , . . . , x n i— > <i n } is a real solution of c. 

A set of propagators -F maintains bounds (a) consistency for a constraint c, 
if for all domains D, solv(F, D) is bounds(a) consistent for c. 

We can define a bounds(Z) propagator, zbnd(c) for a constraint c as follows: 

zbnd(c)(D) = D l~l range(dom(c)(range(D))) 
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search(F ,.F„,L>) 

D := \so\v(F ,F n ,D) 
if (D is a false domain) 

return false 
if (3x G V.|D(«)| > 1) 

choose {ci , . . . , c m } where C h D (= ci V • • • V c, 
for i G [1 .. m] 

if (sea rc h (T UF„, prop (c; ),£>)) 
return frae 
return /afee 
return tree 

Figure 1: Search procedure 

It is not straightforward to give a generic description of bounds (R) propaga- 
tors, rbnd(c), for a constraint c, that just maintains bounds(R) consistency. 
Examples 13.21 and 14.31 define three such propagators. 

3 Constraint Propagation Systems 

A constraint propagation system evaluates the function solv(i ? , D) during back- 
tracking search. We assume an execution model for solving a constraint problem 
with a set of constraints C and an initial domain D as follows. We execute 
the procedure search (0, F, Dq) given in Figure [T] for an initial set of propagators 
F = Ucgc P r °p(c). This procedure is used to make precise the optimizations 
presented in the remainder of the paper. 

Note that the propagators are partitioned into two sets, the old propaga- 
tors F and the new propagators F n . The incremental propagation solver 
isolv(F , F n , D) (to be presented later) takes advantage of the fact that D is 
guaranteed to be a fixpoint of the old propagators. 

The somewhat unusual definition of search is quite general. The default 
search strategy for many problems is to choose a variable x such that |-D(a;)| > 1 
and explore x = inf^a; or x > inf u x + 1. This is commonly thought of as 
changing the domain D for x to either {inf^ x} or {d G D(x) \ d > inf^x}. 
This framework allows more general strategies, for example x\ < xi or x\ > x%. 

The basic incremental propagation solver algorithm is given in Figure[2j The 
algorithm uses a queue Q of propagators to apply. Initially, Q contains the new 
propagators. Each time the while loop is executed, a propagator / is deleted 
from the queue, / is applied, and then all propagators that may no longer be 
at a fixpoint at the new domain D' are added to the queue. An invariant of the 
algorithm is that at the while statement f(D) = D for all / G F — Q. 

The propagation solver isolv leaves two components undefined: choose(Q) 
chooses the propagator / G Q to be applied next; new(f,F,D,D') determines 
the set of propagators /' G F that are not guaranteed to be at their fixpoint at 



% propagation 



% search strategy 



G 



\so\v{F ,F n ,D) 

F := F a U F n ; Q := F n 
while (Q ^ 0) 

f := choose(Q) % select next propagator to apply 

Q := Q -{/};£>':= /(£>) 

Q := Q U new(/, F, D, £)') % add propagators f eF ... 
D := D' % . . . not necessarily at fixpoint at D' 

return D 



Figure 2: Incremental propagation solver. 



the domain D 1 . The remainder of the paper investigates how to best implement 
these two components. 

3.1 Basic Variable Directed Propagation 

The core aim of the constraint propagation solver solv(.F, D) is to find a domain 
that is a mutual fixpoint of all / G F. The incremental solver \so\\/(F , F n , D) 
already takes into account that initially D is a fixpoint of propagators / € F . 
The role of new is (generally) to return as few propagators / 6 F as possible. 
A basic definition of new is as follows 

new input (/, F, D, D') = {/' e F | input(/') n {x e V | D(x) ^ D'(x)} ^ 0} 

Here all propagators /' are added whose input variable domains have changed. 
By the definition of input variables, if none of them have changed for /', then 
f'(D') = D' since /'(£>) = D if /' € F - Q. 

Proposition 3.1 neWi nput maintains the invariant f'(D) = D for all f £ 
F — Q at the start of the while loop. 

Proof: Consider f e F - Q. Then /'(£>) = D and if D = input (/') D' we have 
that D> □ f'(D) = output(/0 f'(D') n D. Then 

D' = D'HD since D'Cfl 

£>' n /'(£») since D = f'(D) 

=out P ut(/') Z'(-D') nfl by definition of input (/') 

f'(D') since f{D') Cfl'Cfl 

Now D' =out P ut(/') /'(■£>') implies £>' = /'(£)') by the definition of output (/')• 
Hence each /' in F — Q is at fixpoint at the start of the loop. □ 

The incremental propagation solver isolv with this definition of new (assum- 
ing F Q = 0) is more or less equivalent to the propagation algorithms in [4] and [H 
page 267]. 
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Example 3.2 [Incremental propagation] Consider the problem with con- 
straints cc = x\ = 2x 2 and cd = x\ — 3x 3 represented by the bounds(K) 
propagators 

fc(D){x 1 ) = £>0i) D [2inf D x 2 .. 2sup D x 2 ] , 
f c (D)(x 2 ) = D(x 2 )n [[|infuori] .. L| sup D xj] , 
fc(D){x) = D(x) ' " x<£{x u x 2 } 

/d{D)(xi) = D(xi) n[3M D x 3 .. 3sup D x 3 ] , 
/d(D)(x 3 ) = £>(z 3 )n [[iinf^xi] .. |_§sup D a;iJ] , 
f D {D)(x) = D{x) x <£ {x 1 ,x 3 }, 

with initial domains D{x 1 ) = [0 .. 17], D(x 2 ) = [0 .. 9], and D(x 3 ) = [0 .. 6]. 
Initially no constraint is at fixpoint, so Q = {fc, Id}- fc is selected initially, 
and we execute the bounds propagator determining D(x\) = [0 .. 16], D(x 2 ) = 
[0 .. 8]. Since x\ has changed, both fc and fr> arc added to the queue. Then fv 
is executed setting D(x\) = [0 .. 15], D(x 3 ) = [0 .. 5]. Again both constraints are 
re-queued, fc is executed changing the domains to D(xi) = [0 .. 14], D(x 2 ) = 
[0 .. 7]. Then f D changes the domains to D(xi) = [0 .. 12], D(x 3 ) = [0 .. 4]. 
Since x\ has changed we have Q — {fc, /d}- Now fc is executed for no change, 
and fo is executed for no change. We have reached a fixpoint D{x\) = [0 .. 12], 
D(x 2 ) = [0 .. 6], and D(x 3 ) = [0 .. 4]. ' □ 

4 Fixpoint Reasoning 

The propagation engine computes a mutual fixpoint of all the propagators. 
Clearly, if we can determine that some propagators are at fixpoint without 
executing them we can limit the amount of work required by the engine. 

4.1 Static Fixpoint Reasoning 

A propagator / is idempotent if f{D) = /(/(£))) for all domains D. That is, 
applying / to any domain D yields a fixpoint of /. 

Example 4.1 [Idempotent propagator] The propagator defined by 

f E (D)( Xl ) = {d e D(xi) | |d e D(x 2 )} 
f E (D)(x 2 ) = {d G D(x 2 ) | \d g D(xi)} 
f E (D)(x) = D(x) xg{x u x 2 } 

is the domain propagator for the constraint 3xi = 2x 2 . The propagator f E is 
idempotent. □ 

It is not difficult to see that each domain propagator dom(c) is idempotent. 

Proposition 4.2 For all constraints c and domains D 

dom(c)(L>) = dom(c)(dom(c)(L>)) 
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Proof: Consider 8 G c where 8 € D. Then by definition 8(x) G dom(c)(D) for all 
x G vars(c). Hence 8 G dom(c)(D). Since dom(c)(D) Cflwe have 8 £ cA8 <E D 
iff 6> G cA6» G dom(c)(L>). Hence dom(c)(D) = dom(c)(dom(c)(D)). □ 

Example 4.3 [Non-idempotent propagators] While many propagators are 
idempotent, some widely used ones are not idempotent. Consider the constraint 
3a;i = 2x2 and the propagator f F : 



[|infi)X2l .. |_| suPd x 2 \ 
\\'mi D x{\ .. [fsup^xij 



f F {D){x x ) = D( Xl )n 
f F (D)(x 2 ) = D{x 2 )n 
f F {D)(x) = D(x) x<?{xi,x 2 } 



In almost all constraint programming systems prop(3xi = 2x2) is {f F } where 
f F is the bounds(R) propagator for 3xi = 2x2- Now f F is not idempotent. 
Consider D(x x ) = [0 .. 3] and D(x 2 ) = [0 .. 5]. Then D' = f F (D) is defined 
by L>'(xi) = [0 .. 3] n [0 .. [10/3J] = [0 .. 3] and D'(x 2 ) = [0 .. 5] n [0 .. L9/2J] = 
[0 .. 4]. Now D" = f F (D') is defined by D"( Xl ) = [0 .. 3] n [0 .. L8/3J] = [0 .. 2] 
and D"(x 2 ) = [0 .. 4] n [0 .. L9/2J] = [0 .. 4]. Hence f F (f F (D)) = D" ^ D' = 
f F {D). □ 

We can always create an idempotent propagator /' from a propagator / by 
defining f'(D) — solv({/}, D). Indeed, in some implementations (for exam- 
ple [16]) prop(3xi = 2x2) is defined as the fixpoint of applying f F . 

Assume that idem(/) = {/} if / is an idempotent propagator and idem(/) = 
otherwise. The definition of new is improved by taking idempotence into 
account 

new sfix (/, F, D, D') = new input (/, F, D, D') - idem(/) 

An idempotent propagator is never put into the queue after application. 

Note that without the idempotence optimization each propagator / that 
changes the domain is likely to be executed again to check it is at fixpoint. Al- 
most all constraint propagation solvers take into account static fixpoint reason- 
ing (for example ILOG Solver [T7], Choco [T5], SICStus [H], and Gecode [12]). 
Some systems even only allow idempotent propagators (for example Mozart [26]). 



4.2 Dynamic Fixpoint Reasoning 

Even if a propagator is not idempotent we can often determine that f{D) is 
a fixpoint of / for a specific domain D. For simplicity we assume a function 
fix(/, D) that returns {/} if it can show that f(D) is a fixpoint for / and 
otherwise (of course without calculating /(/(D)), otherwise we gain nothing). 
In practice this will be included in the implementation of /. 

new dfix (/, F, D, D') = new input (/, F, D, D') - fix(/, D) 

Example 4.4 [Dynamic idempotence] For bounds propagation for linear 
equations on range domains we are guaranteed that the propagator is at a fix- 
point if there is no rounding required in determining new endpoints 16 , Theorem 
8]. 
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Wc can define fix(fp,D) for the bounds propagator fp from Example 14.31 
for the constraint 3xi = 2x2, as returning {/f} if any new bound does not 
require rounding, e.g. 2infijX2/3 = [2 inf^ x 2 /3] or 2 info 3:2/3 < inf^xi and 
similarly for the other three bounds. 

Consider applying fp to the domain D" from the same example. Now D'" = 
fp (D") is defined by D"'( Xl ) = [0 .. 2] n [0 .. L8/3J] = [0 .. 2] and D'"(x 2 ) = 
[0 .. 4] n [0 .. L6/2J] = [0 .. 3]. Notice that the new bound x 2 < 3 is obtained 
without rounding [6/2J = [3J =3. In this case we are guaranteed that the 
propagator is at a fixpoint. □ 

Note that the dynamic case extends the static case since for idempotent / it 
holds that fix(/, D) = {/} for all domains D. The dynamic fixpoint reasoning 
extensions are obviously correct, given Proposition [3TTJ 

Proposition 4.5 newdfix maintains the invariant f(D) — D for all f G F — Q 
at the start of the while loop. 

A complexity of either form of fixpoint reasoning is that a great deal of 
care has to be taken when we claim a propagator is at fixpoint, particularly for 
bounds propagators and for propagators computing with multiple occurrences 
of the same variable. 

Example 4.6 [Falling into domain holes] Consider a bounds propagator for 
xi — X2 + 1 defined as 

f G (D)(x 1 ) = D(xi) n [inf D x 2 + 1 .. sup D x 2 + 1] 
f G (D)(x 2 ) = D(x 2 ) n [info Xi - 1 .. sup D xi - 1] 
f G (D)(x) = D(x) x£{x u x 2 } 

We would expect this propagator to be idempotent since there is no rounding 
required. Consider the application of fc to the domain D(x\) = {0,4,5,6} 
and D(x 2 ) = {2,3,4,5}. Then f G (D) = D' where D'{x x ) = {4,5,6} and 
D'(x 2 ) = {2,3,4,5}. This is not a fixpoint for f G because of the hole (that is 
1, 2,4 ^ D(xi)) in the original domain of X\. □ 

Example 4.7 [Multiple variable occurrences] The regular constraint in- 
troduced in [27] constrains a sequence of variables to take values described by 
a regular expression (or a corresponding finite automaton) . A common case for 
the regular constraint is to express cyclic patterns by performing propagation 
on a sequence of variables where some variables appear multiply. 

Assume a propagator fu propagating that the sequence of variables (x\ ,x 2 ,xs) 
conforms to the regular expression (1 1 100)0 (that is, the values of three vari- 
ables form either the string 110 or 000). 

For a domain D with D(x\) — D(x 2 ) — D(xz) ~ {0,1} propagation for 
the sequence {x\ 1 x 2 , x^) is obtained by checking which values are still possible 
for each variable by traversing the sequence once. In this particular case, for 
D' = f H {D) we have that D'{x 3 ) = {0} and D'(xi) = D'{x 2 ) = {0, 1} and D' 
is a fixpoint for . 
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Now assume a sequence (xi, X2, X\) where x\ appears twice. Using the same 
strategy as above, a single forward traversal of the sequence yields: D' = 
where D'{x\) = {0} and D'{x2) — {0, 1}. However, D' is not a fixpoint for fn- 
□ 

The two examples above describe common cases where a propagator is not 
idempotent but in many cases still computes a fixpoint. In these cases, dynamic 
fixpoint reasoning is beneficial: static fixpoint reasoning would force these prop- 
agators to be never considered to be at fixpoint while dynamic fixpoint reasoning 
allows the propagator to decide whether the propagator is at fixpoint for a given 
domain or not. 

In practice dynamic fixpoint reasoning completely subsumes static fixpoint 
reasoning and is easy to implement. To implement dynamic fixpoint reasoning 
a propagator is extended to not only return the new domain but also a flag 
indicating whether it is guaranteed to be at fixpoint. 

4.3 Fixpoint Reasoning Experiments 

Table [1] shows runtime (walltime) and the number of propagation steps for 
the three different variants of fixpoint reasoning considered. The first column 
presents absolute runtime values in milliseconds and the number of propagation 
steps when no fixpoint reasoning is considered. The two remaining columns 
"static" and "dynamic" show the relative change to runtime and propagation 
steps when using static and dynamic fixpoint reasoning. For example, a relative 
change of +50% means that the system takes 50% more time or propagation 
steps, whereas a value of —50% means that the system takes only half the time or 
propagation steps. The row "average (all)" gives the average (geometric mean) 
of the relative values for all examples. More information on the examples can 
be found in Appendix and on the used platform in Appendix [Bj 

Note that both static and dynamic fixpoint reasoning do not change the 
memory requirements for any of the examples. 

Static fixpoint reasoning While static fixpoint reasoning reduces the num- 
ber of propagator executions by 8.9% in average, the reduction in runtime is 
modest by 1.6% in average. The reason that the reduction in propagation steps 
does not directly translate to a similar reduction in runtime is that the avoided 
steps tend to be cheap: after all, these are steps not performing any propagation 
as the propagator is already at fixpoint. Examples with significant reduction 
in runtime (such as donald-d, minsort-200, picture, and square-5-d) profit 
because the execution of costly propagators is avoided (domain-consistent lin- 
ear equations for donald-d and square-5-d; regular propagators for picture; 
minimum propagators involving up to 200 variables for minsort). The behavior 
of golomb-10-b and golomb-10-d is explained further below. 

Dynamic fixpoint reasoning As expected, both runtime as well as propa- 
gation steps are considerably smaller for dynamic fixpoint reasoning compared 
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Table 1: Fixpoint reasoning experiments. 



Example 


none 


static 


dynamic 




time (ms) 


steps 


time 


steps 


time 


steps 


all-interval-500 


118.31 


503 036 


-3.1% 


-24.8% 


-38.9% 


-24.9% 


alpha 


106.56 


272 398 


-2.0% 


-5.7% 


-1.5% 


-5.7% 


bibd-7-3-60 


2 279.36 


1335 657 


-0.2% 


-6.5% 


-0.2% 


-6.5% 


cars 


4.64 


15 641 


-0.4% 


±0.0% 


±0.1% 


±0.0% 


crowded-chess-7 


624.25 


806 664 


-0.1% 


±0.0% 


-0.2% 


±0.0% 


donald-b 


0.70 


546 


-2.5% 


-8.4% 


-10.4% 


-16.5% 


donald-d 


30.37 


46 


-6.3% 


-6.5% 


-6.4% 


-13.0% 


donald-v 


0.38 


546 


-5.9% 


-16.5% 


-5.1% 


-16.5% 


golomb-10-b 


1 347.48 


2 642 464 


+4.7% 


-17.9% 


±0.1% 


-17.5% 


golomb-10-d 


2 430.00 


2 642 962 


+4.5% 


-18.0% 


±4.3% 


-18.0% 


graph-color 


35.87 


9 344 


-0.1% 


±0.0% 


-7.2% 


-6.2% 


grocery 


55.41 


2 299 


+2.6% 


-3.8% 


±2.6% 


-3.8% 


knights-10 


7.46 


48 038 


+0.9% 


±2.2% 


±0.9% 


±2.2% 


mlnsort-200 


342.48 


240 991 


-11.3% 


-8.3% 


-5.1% 


-16.9% 


o-latin-7-d 


574.36 


387 060 


±0.0% 


-0.5% 


-8.7% 


-18.2% 


partition-32 


8 571.24 


16 785128 


-3.7% 


-18.8% 


-9.2% 


-19.7% 


photo 


108.87 


422 206 


-0.3% 


-0.8% 


-14.6% 


-3.6% 


picture 


1 553.42 


150 165 


-4.1% 


-13.1% 


-3.4% 


-20.5% 


queens-400 


4 433.12 


31424152 


+0.2% 


±0.0% 


±0.0% 


±0.0% 


queens-400-a 


16.15 


2 469 


-1.2% 


-16.2% 


-1.2% 


-16.2% 


sequence-500 


517.96 


151609 


-0.1% 


-0.1% 


-0.1% 


-0.1% 


square-5-d 


33 391.24 


1 762 492 


-6.2% 


-16.0% 


-6.1% 


-16.1% 


square-7-b 


10166.24 


7 731557 


+2.4% 


-14.5% 


-6.8% 


-14.7% 


square-7-v 


5 690.00 


13 956 982 


-4.1% 


-16.9% 


-4.3% 


-16.9% 


warehouse 


0.74 


2 486 


-2.5% 


-1.8% 


-7.9% 


-9.1% 


average (all) 






-1.6% 


-8.9% 


-5.6% 


-11.5% 
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Table 2: Fixpoint reasoning experiments with propagator priorities. 



n , v a m r\ 1 £> 

1 J A nil 1 11M1 , 


static 
time steps 


va y ±iiiL±iiiv_. 

time steps 


all-interval-500 


o noy o 1 no/ 
— z.y/o — 24.y/o 


— o ( . ( /o — io.U/o 


alpha 


rr a 0/ ik /?0y 

— 5.4/0 — 15.D70 


a aOV 1 c coy 
— 4.O/0 — 15.D/0 


bibd-7-3-60 


n '70/ a coy 

— U.7/0 — O.5/0 


n -vO/ e coy 
— U.7/0 — D.O/0 


cars 


o ooy _i_r\ nO/ 

— U.2/0 ±U.U/o 


+U.1/0 ±U.U/o 


crowded-chess-7 


n ^oy _i_n nO/ 

— U.7/0 ±U.U/o 


r\ coy in r\0y 

— U.o/o ±U.U/o 


donald-b 


—4.8/0 —22.9/0 


i i oG7 ori /i oy 

— 11.870 —29.470 


donald-d 


—6.7% —5.17o 


r- r-j(Yf no oOy 

— 6.7/0 —28.8/0 


donald-v 


C CO/ ic CO/ 
— 5.O70 —lb. 070 


c ooy 1 ^ coy 
— 5.-3/0 —lb. 5/o 


golomb-10-b 


yi no/ oo c07 
— 4.9/0 —23.070 


c coy oo roy 
— D.5/0 —2,3.5/0 


golomb— 10 — d 


— o.o/o — ZO.^i/O 


— o.uyo — zo.q/o 


graph - color 


+0.4% ±0.0% 


-7.3% -4.9% 


grocery- 


-4.9% -29.5% 


-4.9% -29.5% 


knights- 10 


±0.0% +1.3% 


+0.2% +1.3% 


minsort-200 


-10.9% -9.1% 


-0.9% -9.2% 


o-latin-7-d 


+0.3% -1.4% 


-6.1% -22.9% 


partition-32 


-8.5% -30.4% 


-11.8% -30.9% 


photo 


-0.7% -1.2% 


-7.5% -2.6% 


picture 


+6.5% -13.1% 


+4.0% -20.5% 


queens-400 


+0.2% ±0.0% 


+0.2% ±0.0% 


queens-400-a 


-1.3% -16.2% 


-1.3% -16.2% 


sequence-500 


+0.1% ±0.0% 


+0.1% ±0.0% 


square-5-d 


-6.4% -11.1% 


-7.3% -15.8% 


square-7-b 


-2.5% -25.2% 


-10.0% -26.0% 


square-7-v 


-6.6% -24.9% 


-5.8% -24.9% 


warehouse 


-2.2% -1.4% 


-7.9% -9.8% 


average (all) 


-2.9% -12.7% 


-6.1% -15.9% 



to static reasoning. This is in particular true for examples donald-b and 
square-7-b where a bounds-consistent alldif f erent constraint can take ad- 
vantage of reporting whether propagation has computed a fixpoint due to no 
domain holes as discussed in Example 14.61 (all-interval-500 shows the same 
behavior due to the absolute value propagator used) . 

Influence of propagation order Some examples show a considerable in- 
crease in runtime (in particular, golomb-10-b and golomb-10-d). This is due 
to the fact that the order in which propagators are executed changes: costly 
propagators are executed often while cheap propagators are executed less often 
(witnessed by the decrease in propagation steps). 

Section [5] presents priorities that order execution according to propagator 
priorities. For now it is sufficient to note that when these priority inversion 
problems are avoided through the use of priorities, there is no increase in run- 
time. Table [2] provides evidence for this. The table shows relative runtime 
and propagation steps (relative to no fixpoint reasoning as in Table [T]) . When 
avoiding priority inversion it becomes clear that both static as well as dynamic 
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fixpoint reasoning consistently improve the number of propagation steps and 
also runtime. The only exception is picture where even with priorities the 
considerable reduction in execution steps does not translate into a reduction in 
runtime. This is possibly due to a change in propagation order that affects run- 
time (that this can have a remarkable effect even with priorities is demonstrated 
in Section O. 

5 Event Reasoning 

The next improvement for avoiding propagators to be put in the queue is to 
consider what changes in domains of input variables can cause the propagator 
to no longer be at a fixpoint. To this end we use events: an event is a change 
in the domain of a variable. 

Assume that the domain D changes to the domain D' C D, A typical set of 
events defined in a constraint propagation system are: 

• fix(x): the variable x becomes fixed, that is \D'(x)\ = 1 and |.D(x)| > 1. 

• lbc(x): the lower bound of variable x changes, that is inf^/ x > infjj x. 

• ubc(cc): the upper bound of variable x changes, that is sup^, x < swp D x. 

• dmc(x): the domain of variable x changes, that is D'(x) C D{x). 

Clearly the events overlap. Whenever a fix(:r) event occurs then a lbc(x) event, 
a ubc(x) event, or both events must also occur. If any of the first three events 
occur then a dmc(x) event occurs. These events satisfy the following property. 

Definition 5.1 [Event] An event is a change in domain defined by an event 
condition <j>(D, D') which states that event <f> occurs when the domain changes 
from D to D' CD. The event condition must satisfy the following property 

<f>(D, D") = <f>(D, D') V 4>(D', D") 

where D" C D' C D. So an event occurs on a change from D to D" iff it occurs 
in either the change from D to D' or from D' to D" . 

Given a domain D and a stronger domain D' C D, then events(D, D') is the 
set of events <f> where cj>(D,D'). Suppose D" C. D' C D, then clearly 

events(£>, D") = cvcnts(£), D') U events(Z)', D"). (1) 

Most integer propagation solvers use the events defined above, although 
many systems collapse ubc(x) and lbc(x) into a single event hc{x) (for exam- 
ple, SICStus PI], ILOG Solver [T7], and Gecode [H]). Choco 19J maintains 
an event queue and interleaves propagator execution with events causing more 
propagators to be added to the queue. 

Other kinds of events or variants of the above events are also possible. For 
example, (for domains D and D' with D' □ D): 
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• bc(x): as discussed above (lbc(a;) V ubc(x)). 

• two(x): the variable x reduces to a domain of at most two values: \D'(x)\ < 
2 and \D(x)\ > 2. 

• ran(x): the variable x reduces to a domain that will always be a range 
(two consecutive values or a single value) : sup D / x — inf d> x < 1 and 
s\rp D x — infu > 1. 

• pos(x): the variable x reduces to a domain that is strictly positive, that 
is inf^, x > and infu x < (likewise, an event neg(x) for reduction to a 
strictly negative domain). 

• nneg(x): the variable x reduces to a domain that is non- negative: inf'^, x > 
and inf d x < (likewise, an event npos(x) for reduction to a non-positive 
domain) . 

• neq(x,<i): the variable x can no longer take the value d, that is d £ D(x) 
and d g D'(x) 

The events two and ran are useful for tracking endpoint-relevance and range- 
equivalence [31] ■ The neq event has been used in e.g. Choco [TH] and B- 
Prolog [37] for building AC4 [23] style propagators. 

Example 5.2 [Events] Let D(xi) = {1,2,3}, D(x 2 ) = {3,4,5,6}, D(x 3 ) = 
{0, 1}, and D{ Xi ) = {7, 8, 10} while D'(xi) = {1, 2}, D'(x 2 ) = {3, 5, 6}, D'(x 3 ) = 
{1} and D'(x 4 ) = {7, 8, 10}. Then events(L>, D') is 

{ubc(a;i), dmc(xi), dmc^), ^(x^), lbc(xa), dmc(i3)} 

Considering the additional events we obtain in addition 

{bc(xi), two(xi), ran(xi), bc(x 3 ), neq(xi, 3), neq(a; 2 , 4), neq(x3, 0)} 

□ 

Example 5.3 [Events are monotonic] Events are monotonic: further changes 
to a domain do not discard events from previous changes. Consider the property 
range(x) capturing that D(x) is a range for a domain D (this property is related 
to the event ran(x), lacking the restriction that the domain can have at most 
two elements). 

The property range(x) is not an event: consider domains D" C D' C D 
withZ>(x) = {1,2,3,5}, D'(x) = {1,2,3}, and D"(x) = {1,3}. If range were 
an event, then events(D, D") = events(D, D') U events(-D', D"). However, 

events(£>, D") = {dmc(x), ubc(x)} 

whereas 

events(D, D') U events(£>', D") = {dmc(x), ubc(x), range(x)} U {dmc(x)} 

□ 
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5.1 Static Event Sets 

Re-execution of certain propagators can be avoided since they require certain 
events to generate new information. 

Definition 5.4 [Propagator dependence] A propagator / is dependent on 
a set of events es(/) iff 

(a) for all domains D if f(D) ^ /(/(£>)) then events(D, /(£))) n es(/) ^ 0, 

(b) for all domains D and D' where f(D) = D, D' C D and /(£>') ^ £>' then 
events(L>, £>') n es(/) ^ 0. 

The definition captures the following. If / is not at a fixpoint then one of 
the events in its event set occurs. If / is at a fixpoint D then any change to a 
domain that is not a fixpoint D involves an occurrence of one of the events in 
its set. Note that for idempotent propagators the case (a) never occurs. 

For convenience later we will store the event set chosen for a propagator / 
in an array evset [/] . 

Clearly, if we keep track of the events since the last invocation of a propaga- 
tor, we do not need to apply a propagator if it is not dependent on any of these 
events. 

Example 5.5 [Event sets] Event sets for previously discussed propagators are 
as follows: 

f A {ubc(x 2 )} 

]b {lbc(ari)} 

/b {dmc(a;i),dmc(x2)} 

f F {lbc(xi), ubc(xi), lbc(x 2 ), ubc(x 2 )} 

This is easy to see from the definitions of these propagators. If they use inf^i x 
then lbc(a;) is in the event set, similarly if they use sup^ x then ubc(x) is in the 
event set. If they use the entire domain D(x) then dmc(a;) is in the event set. 
□ 

Indexical propagation solvers [34[ [9l [6] are based on such reasoning. They 
define propagators in the form f(D)(x) = D{x) D e(D) where e is an indexical 
expression. The event set for such propagators is automatically defined by the 
domain access terms that occur in the expression e. 

Example 5.6 [Indexical] An example of an indexical to propagate x% > x 2 + 1 
is 

x\ £ [inf(£ 2 ) + 1 .. + oo] 

22 £ [ — OO .. SUp(xi) — 1] 

These range expressions for indexicals define two propagators: 

fi(D)( Xl ) = D(xi) n [inffca) + 1 •• + oo] 

fi(D)(x) = D(x) x^ Xl 

fj{D)(x 2 ) = D(x 2 ) n [-co .. sup^O - 1] 

}j{D){x) - D(x) x^x 2 
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The event set for the propagator // from the definition is {lbc(x2)} while 
the the event set for fj is {ubc(a;i)}. □ 



Using events we can define a much more accurate version of new that only 
adds propagators for which one of the events in its event set has occurred. 

new cvcnts (/, F, D, D') = {/' G F | evset[/'] n events (£>, D 1 ) ± 0} - fix(/, D) 

This version of new (without dynamic fixpoint reasoning) roughly corresponds 
with what most constraint propagation systems currently implement. 

Proposition 5.7 new cvcn t s maintains the invariant f(D) = D for all f G F—Q 
at the start of the while loop. 

Proof: Consider f'sF — Q — {/} different from the selected propagator /. 
Then f'(D) = D and if f'(D') ^ D' then events(L>, D')nes(f) ^ by case (b) 
of the definition of es(f'), so /' G Q at the start of the loop. 

Consider selected propagator /. This is removed from Q, but if f(D') ^ D' 
then events(D, D') n es(/) ^ by case (a) of the definition of es(/). Clearly 
also fix(/, D) 0. So / G Q at the start of the loop. □ 

5.2 Event Set Experiments 

Table [3] shows runtime and number of propagation steps for different event 
sets relative to a propagation engine not using events (the engine uses dynamic 
fixpoint reasoning but no priorities). The row "average (above)" gives the geo- 
metric mean of the relative numbers given in the table whereas "average (all)" 
shows the relative numbers for all examples. 

General observations A first, quite surprising, observation is that using no 
events at all is not so bad. It is the best approach for 10 out of the 25 benchmarks 
and for crowded-criess-7 by a considerable margin (between 8.3% and 38.7%). 
Another general observation is that a reduction in the number of propagation 
steps does not directly translate into a reduction in runtime. This is due to the 
fact that all saved propagator executions are cheap: the propagator is already 
at fixpoint and does not have to perform propagation. 

Note that the ratio between reduction in steps and reduction in runtime 
ultimately depends on the underlying system. In Gecode, the system used, the 
actual overhead for executing a propagator is rather low. In systems with higher 
overhead one can expect that the gain in runtime will be more pronounced. 

Event set observations Adding the fix event is particularly beneficial for 
benchmarks with many disequalities, particularly queens-400 which only uses 
disequalities. 

Adding the be event has significant benefit (up to 5%) when there are linear 
equalities as in alpha or bounds(Z) consistent alldif f erent as in photo. But 
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Table 3: Event set experiments (runtime and propagation steps). 



Example 


only fix, dmc 
time steps 


with be 
time steps 


with lbc, ubc 
time steps 


all-interval-500 


+0.6% ±0.0% 


+1.0% ±0.0% 


+2.0% ±0.0% 


alpha 


-2.0% -4.4% 


-7.3% -19.2% 


-6.3% -19.3% 


bibd-7-3-60 


-2.6% -16.6% 


-1.4% -16.6% 


+7.4% -15.8% 


cars 


+1.1% -0.2% 


+ 1.7% -0.2% 


+2.6% -0.2% 


crowded-chess-7 


+8.3% ±0.0% 


+20.2% -8.4% 


+38.7% -8.4% 


donald-b 


+0.3% ±0.0% 


-2.1% -9.2% 


-1.3% -9.2% 


donald-d 


±0.0% ±0.0% 


+0.9% ±0.0% 


-0.1% ±0.0% 


donald-v 


+0.6% -10.5% 


-1.4% -19.7% 


+0.1% -19.7% 


golomb-10-b 


-0.6% ±0.0% 


±0.0% -3.9% 


-0.5% -3.9% 


golomb-10-d 


+0.2% ±0.0% 


+26.4% -3.5% 


+26.5% -3.5% 


graph-color 


-0.7% -47.7% 


+0.1% -49.5% 


+0.6% -49.5% 


grocery 


-0.1% ±0.0% 


±0.0% ±0.0% 


±0.0% ±0.0% 


knights-10 


-12.7% -47.5% 


-10.8% -48.6% 


-7.7% -48.6% 


minsort-200 


+0.4% ±0.0% 


+1.1% ±0.0% 


+1.5% ±0.0% 


o-latin-7-d 


-1.2% ±0.0% 


+0.2% -1.7% 


+1.5% -1.7% 


partltion-32 


+0.7% ±0.0% 


+15.1% +3.4% 


+18.8% +17.7% 


photo 


+ 1.6% ±0.0% 


-3.3% -27.1% 


-2.0% -26.9% 


picture 


+2.0% ±0.0% 


+3.5% ±0.0% 


+11.5% ±0.0% 


queens-400 


-87.5% -99.1% 


-87.5% -99.1% 


-87.5% -99.1% 


queens-400-a 


-10.4% -38.8% 


-9.6% -38.8% 


-8.2% -38.8% 


sequence-500 


+2.2% ±0.0% 


+3.5% +36.3% 


+8.8% +36.3% 


square-5-d 


+0.8% ±0.0% 


+2.7% ±0.0% 


+2.6% +0.5% 


square-7-b 


-0.2% +0.5% 


+0.6% -8.2% 


+0.7% -7.8% 


square-7-v 


-1.0% -8.9% 


-1.7% -16.9% 


+1.2% -16.9% 


warehouse 


+2.2% +0.2% 


+3.3% -7.6% 


+4.2% -6.6% 


average (all) 


-8.5% -24.3% 


-6.7% -26.9% 


-4.6% -26.4% 



18 



Table 4: Event set experiments with priorities (runtime and propagation steps). 



Example 


only fix, dmc 


with be 


with lbc, ubc 




time 


steps 


time 


steps 


time 


steps 


bibd-7-3-60 


+6.1% 


-3.3% 


+0.6% 


-3.3% 


+13.1% 


-2.5% 


crowded-chess-7 


+8.0% 


±0.0% 


+20.4% 


-0.7% 


+38.0% 


-0.7% 


donald-b 


+0.9% 


±0.0% 


-1.1% 


-9.2% 


-0.7% 


-9.2% 


donald-d 


-0.3% 


±0.0% 


+0.1% 


±0.0% 


-0.2% 


±0.0% 


golomb-10-b 


+0.1% 


±0.0% 


-0.9% 


-5.5% 


-0.9% 


-5.5% 


golomb-10-d 


+0.1% 


±0.0% 


-0.5% 


-7.8% 


±0.0% 


-7.8% 


minsort-200 


+0.9% 


±0.0% 


+1.5% 


±0.0% 


+1.8% 


±0.0% 


partition-32 


+1.9% 


-1.2% 


+1.8% 


-1.8% 


+1.0% 


-0.4% 


picture 


+2.6% 


±0.0% 


+3.7% 


±0.0% 


+1.6% 


±0.0% 


square-5-d 


+0.9% 


±0.0% 


+3.2% 


±0.0% 


+3.2% 


+0.2% 


square-7-b 


+0.4% 


-0.1% 


+0.2% 


-10.6% 


+1.2% 


-10.6% 


average (above) 


+1.9% 


-0.4% 


+2.5% 


-3.6% 


+4.8% 


-3.4% 


average (all) 


-7.8% 


-24.1% 


-7.8% 


-27.8% 


-6.3% 


-27.7% 



for other examples, where one would expect some reduction in runtime, the over- 
head (to be discussed in more detail below) for maintaining a richer event set 
exceeds the gains from reducing the number of propagation steps. This is true for 
examples such as minsort-200 (minimum propagators), partition-32 (multi- 
plication), and golomb-10-b and square-7-b (linear equations and bounds(Z) 
consistent alldif f erent). 

Splitting the be event into lbc and ubc events exposes the overhead once 
more. There is almost never an improvement in number of propagations, since 
only inequalities can actually benefit, and there is substantial overhead. 

Influence of propagation order Similar to using fixpoint reasoning, the 
use of events also changes the order in which propagators are executed. Table H] 
reconsiders all examples that could possibly benefit from be or lbc, ubc events 
and all examples that show a remarkable increase in number of propagation 
steps in Table El 

The numbers confirm that with priorities no considerable increase in runtime 
can be observed for all but crowded-chess-7, where the increase in runtime here 
is due to a change in propagation order to the introduction of event sets that 
does not depend on the relative priorities of the propagators used. 

Once priorities are used, lbc and ubc are basically never beneficial. 

Memory requirements Table [5] shows the total memory allocated for dif- 
ferent event sets relative to a propagation engine not using events (the engine 
uses dynamic fixpoint reasoning and priorities). It is important to note that the 
memory figures reflect the amount of memory allocated which is bigger than 
the amount of memory actually used. In particular, for small examples such 
as donald-* the increase in allocated memory just reflects the fact that an 
additional memory block gets allocated. 
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Table 5: Event set experiments with priorities (allocated memory). 



Example 


no events 


only fix, dmc 


with be 


with lbc, ubc 




mem (KB) 


mem 


mem 


mem 


bibd-7-3-60 


6 690.1 


+2.9% 


±5.7% 


±14.4% 


crowded-chess-7 


198.2 


+4.0% 


±13.1% 


±21.2% 


donald-b 


7.8 


±0.0% 


±12.8% 


±12.8% 


donald-d 


3.4 


±0.0% 


±58.2% 


±58.2% 


donald-v 


5.8 


±34.3% 


±34.3% 


±51.5% 


golomb-10-b 


39.7 


±2.6% 


±10.1% 


±12.7% 


golomb-10-d 


37.7 


±0.0% 


±2.8% 


±16.0% 


minsort-200 


32 419.1 


±1.1% 


±2.3% 


±17.4% 


partition-32 


160.3 


±3.7% 


±9.4% 


±17.5% 


picture 


451.6 


±17.7% 


±17.7% 


±30.1% 


queens-400 


30 286.6 


±0.2% 


±0.2% 


±0.2% 


queens-400-a 


567.0 


±2.8% 


±4.2% 


±4.2% 


average (above) 




±5.4% 


±13.3% 


±20.3% 


average (all) 




±3.9% 


±9.9% 


±15.5% 



Using just a fix event increases the required memory by less than 4% in 
average, while using full event sets increases the required memory by 15.5% in 
average. 

It must be noted that Gecode (the system used) has a particularly efficient 
implementation of event sets: the implementation uses a single pointer for an 
entry in an event set; entries for the same variable and the same propagator 
but with different events still use a single pointer. The memory overhead is due 
to the fact that per variable and supported event, one single pointer for book- 
keeping is needed. Hence, the highest overhead can be expected for examples 
with many variables but relatively few propagators and small event sets (such 
as bibd-7-3-60, crowded-chess-7, and picture). The overhead becomes less 
noticeable for examples with many propagators or large event sets and few vari- 
ables (such as queens-400 and queens-400-a). 

Summary In summary, while there is a compelling argument for fix events, 
there is only a weak case for be being supported, and lbc and ubc should not 
be used. 

5.3 Dynamic Event Sets 

Events help to improve the efficiency of a propagation-based solver. Just as we 
can improve the use of fixpoint reasoning by examining the dynamic case, we 
can also consider dynamically updating event sets as more information is known 
about the variables in the propagator. 

Monotonic event sets 
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Definition 5.8 [Monotonic propagator dependence] A propagator / is 
monotonically dependent on a set of events es(/, D) in the context of domain D 
iff 

(a) for all domains D C D if /(D> ) ^ /(/(-Do)) then events (A), /(-Do)) n 
es(/,£>)^0, 

(b) for domains Do and Di where -Do E -D, /(-Do) = -Do, -Di C -Do and 
/(Di) / Dr then events(-D , D x ) n es(/, D) ^ 0. 

Clearly given this definition es(/, D) is monotonically decreasing with D. 
The simplest kind of event reduction occurs by subsumption. 

Definition 5.9 [Subsumption] A propagator / is subsumed for domain D, if 
for each domain D' C D we have f(D') = D' . 

A subsumed propagator makes no future contribution. If / is subsumed 
by D then es(/, D) = and / is never re-applied. Most current constraint 
propagation systems take into account propagator subsumption. 

Example 5.10 [Subsumption] Consider the propagator /^ and the domain 
D with D{xi) = [1 .. 3] and D(a;a) = [3 .. 7]. Then the constraint holds for all 
D' C D and es(/, D) = 0. □ 

Changing event sets can occur in cases other than subsumption. 

Example 5.11 [Minimum propagator] Consider the propagator fx for xo — 
mm(a:i,X2) defined by 

f K (D)(x ) = D(x ) n [min(inf£)Xi,inf D a;2) .. min(sup D Xi,sup D x 2 )] 
f K (D){x t ) = D( Xi ) n [w£ D *o •• + oo] i£ {1, 2} 

f K (D)(x) = D(x) x^{x ,xi,x 2 } 

The static event set es(fx) is {lbc(xo), lbc(ari), ubc(xi), lbc{x2), ubc(x2)}. Note 
that this propagator is idempotent. 

But given domain D where D{xq) = [1 .. 3] and D(x2) — [5 .. 7] we know that 
modifying the value of X2 will never cause propagation. A minimal definition of 
es(/if,-D) is {lbc(xo), lbc(xi), ubc(xi)}. □ 

Example 5.12 [exactly propagator] Another example is a propagator for 
the exactly constraint [35] : exactlyQxi, . . . ,x n ],m, k) states that exactly m 
out of the variables x\, . . . ,x n are equal to a value k. As soon as one of the Xi 
becomes different from fc, all events for Xi can be ignored. Originally the events 
are dmc(xi), 1 < i < m, lbc(m), ubc(m) and dmc(fc). 

Suppose a domain D where D{k) = {1, 3, 8} and -D(x 3 ) = {2, 5, 6, 10, 11, 12}, 
then X3 ^ k and we know its contribution to the exactly constraint. We can 
remove the event dmc(xa) from the event set safely. □ 



21 



Other examples for monotonic event sets are propagators for the lex con- 
straint and the generalized element constraint. When using a variant of the lex 
propagator proposed in [5], events can be removed as soon as the order among 
pairs of variables being compared can be decided. For the generalized element 
constraint [7] where the array elements are variables, events for a variable from 
the array can be safely removed as soon as the variable becomes known to be 
different from the result of the element constraint. 

Using monotonic dynamic event sets we can refine our definition of new as 
follows. 

new mevents (/,F, D,D') 

F' := {/' G F | evset[/'] n events(L>, D')} - 6x(f, D) 
evset[/] := es(f,D>) 
return F' 

Every time a propagator / is applied its event set is updated to take into account 
newly available information. 

A related idea is the "type reduction" of [3U] where propagators are im- 
proved as more knowledge on domains (here called types) becomes available. 
For example, the implementation of xq — x± x X2 will be replaced by a more 
efficient one, when all elements in D{x\) and D(x2) are non-negative. Here we 
concentrate on how the event sets change. The two ideas could be merged as 
they are complementary. 

Proposition 5.13 new movon t s maintains the invariant f(D) = D for all f € 
F — Q at the start of the while loop. 

Proof: The proof is almost identical to that for Proposition 15.71 since we are 
working in a context if evset[/] = es(f,D*) then D C D* . 

For propagators using the monotonic event sets, we have the invariant that 
/ € F — Q iff evset[/] = es(f,D*) where D* is the result of the last time we 
executed propagator /. 

Suppose we have /' 6 F - Q - {/} where /'(£>') ^ D' and f(D) = D then 
events(L>*, D') n cs(/, D*) ^ and since f'^Qwe have that events(D*, D) n 
es(/, D*) = otherwise we would have placed / in the queue already, hence by 
the equation (JTJ) events(Z), D')<les(f, D*) ^ so /' E Q at the start of the loop. 

Consider selected propagator /. Then it is removed from Q but if f(D') ^ D' 
then clearly fix(/, D) = and using case (a) of Definition 15.81 we have that 
events(L>, D') n es(/, D) ^ 0. Hence / G Q at the start of the loop. □ 

Fully dynamic event sets Note that for many propagators we can be more 
aggressive in our definition of event sets if we allow the event sets to change in 
a manner that is not necessarily monotonically decreasing. 

Definition 5.14 [General propagator dependence] A propagator / is de- 
pendent on a set of events es(/, D) in the context of domain D if for all domains 
Di where DiCD and f(Di) ^ D x then events(L>, D x ) n es(/, D) ^ 0. 
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Using fully dynamic event sets we can refine our definition of new as follows. 

new devents (f,F,D,D') 

F' := {/' G F | evset[/'] n events(L>, D')} - fbc(/, D) 
if (fix(/, D) = 0) 

F' := F' U {/} 
evset[/] := es(/, £>') 
return F' 

Every time a propagator / is applied its event set is updated to take into 
account newly available information. The only difficult case is that if the def- 
inition of dynamic dependency does not capture the events that occur when 
moving from D to D' = f(D). If fixpoint reasoning cannot guarantee a fixpoint 
we need to add / to the queue. 

Fully dynamic event sets are more powerful than monotonically decreasing 
event sets, but in general they require reasoning about the new event sets each 
time the propagator is run. 

Example 5.15 [Fully dynamic events for minimum] Given the propagator 
fic from Example 15.111 and the domain D where D(xq) = [0 .. 10], D(x-y) — 
[0 .. 15], and D(x>}) — [5 .. 10]. D is a fixpoint of fx and a minimal set es(/if, D) 
is 

{lbc(xo), lbc(a;i), ubc(xi), ubc^)} 

While at D' C D where D'(x ) = [5 .. 9], D'{xt) = [6 .. 9], and D'(x 2 ) = 
[5 .. 10], which is also a fixpoint, the minimal set es(/if , D') is 

{lbc(a;o), ubc(xi), lbc(x 2 ), ubc(x 2 )} 

For the constraint ck we simply need to maintain a lbc event for some 
variable Xi in the right hand side with the minimal lbc value. □ 

Proposition 5.16 riew,jevents maintains the invariant f(D) = D for all f £ 
F — Q at the start of the while loop. 

Proof: We have the invariant that / e F - Q iff evset[/] = es(/, D*) where D* 
is the result of the last time we executed propagator /. 

Suppose we have f E F - Q - {/} where /'(£>') ^ D' and f(D) = D then 
events(Z)*, D') n es(/, D*) ^ and since f'^Qwe have that events(D*, D) n 
es(f,D*) = otherwise we would have placed /' in the queue already, hence 
by the equation (TT]) events(£>, D 1 ) n es(/, D*) ^ so /' G Q at the start of the 
loop. 

Consider the selected propagator /. The same reasoning cannot apply since 
even though we know that events(D*, D') l~les(/, D* ) ^ 0, we have no guarantee 
that events(-D, D') R es(/, D*) is not empty. Now if fix(/, D) = then possibly 
/(£>') ^ D', but this will force / G Q by the start of the loop. □ 

Effectively the fully dynamic event sets approach relies only on the dynamic 
fixpoint reasoning of the propagator / to handle what happens when moving 
from D to D' . 
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Table 6: Dynamic event sets experiments (with priorities). 



Example 




monotonic 






: ully dynamic 






time 


steps 


mem 


time 


steps 


mem 


bibd-7-3-60 


+0.4% 


±0.0% 


±0.0% 


-5.0% 


-5.2% 


±0.0% 


crowded-chess-7 


-24.0% 


±0.0% 


-15.2% 


-28.2% 


-34.2% 


-17.8% 


sequence-500 


-82.3% 


-78.3% 


-49.5% 


-82.1% 


-78.3% 


-49.5% 


o-latin-7-d 


-0.7% 


±0.0% 


±0.0% 


-1.7% 


±0.0% 


±0.0% 


average (above) 


-39.5% 


-31.8% 


-19.1% 


-41.1% 


-39.3% 


-19.7% 


average (all) 


-10.8% 


-8.2% 


-4.1% 


-11.2% 


-9.9% 


-4.2% 



Fully dynamic event sets are closely related to the watched literals approach 
to improving unit propagation in SAT solving [25]. Using watched literals, unit 
propagation only considers a clause for propagation if one of two watched literals 
in the clause becomes false. Recently, the idea of watched literals has been used 
in constraint programming for the Minion solver [15] . Watched literals differ 
from the events we concentrate on here since they take into account values 
(similar to the neq(x, a) event). Note that dynamic event sets do not usually 
have the property of watched literals, in that they do not need to be updated 
on backtracking. 

5.4 Dynamic Event Sets Experiments 

Table [6] shows the comparison of monotonic and fully dynamic event sets to a 
propagation solver using static event sets with {fix, be, dmc} events and priori- 
ties to avoid priority inversion as discussed before. The table lists only examples 
where dynamic event sets are used. 

The propagators using monotonic event sets are as follows: for bibd-7-3-60: 
lex; for crowded-chess-7: exactly and element; for sequence-500: exactly; 
for o-latin-7-d: lex. Clearly, monotonic event sets lead to a drastic reduction 
in both runtime and memory usage, where it is worth noting that the reduction 
in time is even more marked than the reduction in propagation steps (each 
propagation step becomes cheaper as smaller event sets must be maintained) . 

Fully dynamic event sets are considered in Boolean-sum propagators used in 
bibd-7-3-60 and crowded-chess-7. The difference in improvement between 
the two examples can be explained by the fact that crowded-chess-7 uses 
Boolean-sums as inequalities while bibd-7-3-60 uses Boolean-sums as equalities 
where inequalities offer the potential for considerably smaller event sets [15] . 

Example bibd-7-3-60 provides another insight: the drastic reduction in 
runtime observed in [13] by using watched literals for Boolean-sum propagators 
in the Minion solver is most likely not due to small event sets but to other 
aspects. A possible aspect is the knowledge about which variables have been 
modified when a propagator is executed. 
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Table 7: Queue versus stack experiments. 



1 1 A fli [ 1 1 L / 1 1 . 


queue 


st cick 






time (ms) 


steps 


time 


steps 


all-interval-500 


73.00 


377 777 


+ 1OOZO.U70 


+OZ.Z70 


alpha 


97.31 


207 470 


+47.5% 


+99.0% 


bibd-7-3-60 


2 020.30 


1020162 


+7.7% 


+ 102.7% 


cars 


4.65 14 860 


+1.3% 


CO/ 

+2.670 


crowded-chess-7 


ooo.zo 


DDU OZO 


+4.7/0 


+68.1% 


donald-b 


0.62 


414 


+ 00.070 


+31.9% 


donald-d 


28.53 


40 


r: a 07 

.4/o 


+7.5% 


donald-v 


0.36 


366 


+ 7.D70 


+2,6.1/0 


golomb-10-b 


1342.48 


2 095161 


+870. 0% 


+171.5% 


golomb - 10 — d 


3 201.84 


2 091691 


— ii.y/o 


1 no nor 
i-yo.y/o 


graph - color 


33.54 


4 422 


+299.7% 


+ 118.4% 


grocery 


56.87 


2 211 


+44.2% 


-6.3% 


knights-10 


6.66 


25 225 


+7.7% 


+53.3% 


minsort-200 


141.84 


113 437 


+1580.1% 


1-1123.3% 


o-latin-7-d 


532.48 


311 135 


+4.9% 


+24.9% 


partition-32 


9 001.24 


13 939101 


-18.5% 


+34.8% 


photo 


90.37 


296 661 


+11.1% 


+8.7% 


picture 


1583.12 


119 406 


+27.1% 


+79.9% 


queens-400 


549.36 


268 771 


±0.0% 


-0.1% 


queens-400-a 


14.40 


1265 


-0.9% 


-0.3% 


sequence-500 


98.12 


56 048 


+53.8% 


+240.9% 


square-5-d 


32 263.12 


1 478 403 


+71.5% 


+57.7% 


square-7-b 


9 491.84 


6 054 656 


+75.3% 


+44.9% 


square-7-v 


5 345.00 


9 636 675 


+8.3% 


+35.2% 


warehouse 


0.70 


2 075 


+ 11.7% 


+29.3% 


average (all) 






+78.4% 


+61.0% 



6 Which Propagator to Execute Next 

We now address how to define which propagator / in the queue Q should execute 
first, that is how to define the choose function. 

The simplest policy to implement is a FIFO (First In First Out) queue 
of propagators. Propagators are added to the queue, if they are not already 
present, and choose selects the oldest propagator in the queue. The FIFO 
policy ensures fairness so that computation is not dominated by a single group 
of propagators, while possibly not discovering failure (a false domain) from other 
propagators quickly. 

The equally simple LIFO (Last In First Out) policy is a stack where prop- 
agators not already in the stack are pushed, and choose selects the top of the 
stack. 
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6.1 Basic Queuing Strategy Experiments 

Table [7] compares using a FIFO queue versus a LIFO stack. The result, accord- 
ing with folklore knowledge, clearly illustrates that a queue must be used. The 
few cases where a stack is better are comprehensively outweighed by the worst 
cases for a stack. 

Later experiments in Section [6.31 using priorities with combinations of FIFO 
queues and LIFO stacks reveal that the pathological behavior of all-interval-500 
is due to priority inversion. However, the pathological behavior of minsort-200 
is due to the use of a LIFO stack. 

6.2 Static Priorities 

A statically prioritized queue associates with each propagator a fixed priority, 
we will assume an integer in the range [0 .. k — 1]. In effect, the queue Q is 
split into k queues, Q[0], . . . Q[k — 1] where each Q[i] is a FIFO queue for the 
propagators with priority i. Selection always chooses the oldest propagator in 
the lowest numbered queue Q[i] that is non-empty. Static prioritization allows 
one to ensure that quick propagators are executed before slow propagators. 

We give an example of seven static priorities, with names of the integer pri- 
orities as follows: unary=0, binary=l, ternary=2, linear=3, quadratic=4, 
cubic=5, and veryslow=6. The names are meant to represent the arity of the 
constraint, and then the asymptotic runtime of the propagator, once the con- 
straint can handle n variables. So binary is for binary constraints, quadratic 
is for constraints that are approximately 0(n 2 ) for instances with n variables. 

Example 6.1 [Propagator priorities] For example, the propagator Jl for 
even(xi) defined by 

f L {D){ Xl ) = D{xx)ri[2\\voiDx{\..2\_\sa 9D {xx)\] 
f L (D)(x) = D{x) x^x Y 

might be given priority unary, while /e and fp might be given priority binary. 

The domain propagator defined in [29] for the alldifferent constraint 
A" = i A™ =i+1 i£i Xj (with complexity 0(n 2 5 )) might be given priority quadratic. 

The alldifferent bounds(Z) propagator defined in [28] (with complexity 
0(n log n)) might be given priority linear. □ 

Priorities in effect force many more fixpoints to be calculated. A fixpoint of 
all propagators at priority level i and lower must be reached before a propagator 
at priority level i + 1 is run. This means we will often cause more propagators 
to run when using priorities, but more cheap propagators! 

Example 6.2 [Repeated fixpoints] Consider the execution of a system of 
propagators for constraints cc = x% — 2x2, cd = %i = 3^2, cm = £2 < 
6 — » x\ < X3 + 7 and cat = alldif f erent[iEi, x%, X3, £4, X5]. We will use the 
bounds (M) propagators fc, /d, Im for the first three constraints, and the do- 
main propagator, /at, for alldifferent from [29] . for the last constraint. Let 
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the initial domain be D{x 1 ) = [0 .. 18], D(x 2 ) = [0 .. 9], D(x 3 ) = [0 .. 6], and 
D(xi) = D(x§) = [0 .. 3]. The priorities are binary, binary, ternary, and 
quadratic respectively. All propagators are at fixpoint. 

Suppose the domain of x% changes to [0 .. 17]. All propagators are enqued. 
The priority level binary propagators are run to fixpoint (as in Example 13. 2p 
giving D(xi) = [0 .. 12], D{x 2 ) = [0 .. 6], D(x 3 ) = [0 .. 4] Then f M is sched- 
uled and causes D(xi) = [0 .. 11]. Both fc and fn are enqued, and after 
executing fc, /d, fc and fr> the next fixpoint of the binary priority prop- 
agators is reached: D(xi) = [0 .. 6], D(x 2 ) = [0 .. 3], D(x 3 ) = [0 .. 2]. Then 
Jm is scheduled again and causes no change. After that, /jv is executed, it 
reduces the domains of D(x\) = [4 .. 6] since all the values [0 .. 3] are required 
for the variables x 2 , x 3 , X5. Both fc and fn are enqued, and after execut- 
ing we reach their fixpoint D(x{) — {6}, D{x 2 ) — {3}, D(x 3 ) = {2}. Once 
again /m is executed for no change. Then fx is executed once more obtaining 
D{xa) — D{x§) — {0, 1}. This is the overall fixpoint. □ 

We can adjust the granularity of the priorities: we can have a finer version 
of the above priorities with 14 priorities, each priority above with a low and 
high version. This allows us to separate, for example, a domain consistent 
propagator for the binary absolute value constraint abs(ir) = y (binary-low) 
from a bounds consistent propagator for the same constraint (binary-high). 
This increased granularity will be important in Section [7] 

Conversely, we may collapse the priorities into fewer levels, for example into 
three levels unary-ternary, linear-quadratic, cubic-veryslow. 

Another model for priorities in constraint propagation based on composition 
operators is |14j . The model, however, runs all propagators of lower priority 
before switching propagation back to propagators of higher priority. This model 
does not preempt computing a fixpoint for a low priority. The model always 
completes a fixpoint for a given priority level and only then possibly continues 
at a higher priority level. 

Most systems have some form of static priorities, typically using two priority 
levels (for example, SICStus [18], Mozart [26]). The two levels are often not 
entirely based on cost: in SICStus all indexicals have high priority and all other 
lower priority. While ECL 4 PS e [36] [15] supports twelve priority levels, its finite 
domain solver also uses only two priority levels where another level is used to 
support constraint debugging. 

A similar, but more powerful approach is used by Choco [19] using seven 
priority levels allowing both LIFO and FIFO traversal. 

Prioritizing particular operations during constraint propagation is important 
in general. For interval narrowing, prioritizing constraints can avoid slow 
convergence, see for example [2D] . 

The prioritizing of propagators by cost is important, inverting the priorities 
can lead to significant disadvantages. 

Example 6.3 [Inverted priorities] Consider executing Example 16.21 with in- 
verted priorities. We first execute Jm then Jm for no effect. Then executing fc 
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Table 8: Priority experiments with varying granularities. 



HVvSI TYlTll 
1 t A fli 1 1 1 L 1 1 1 . 


cm all 

time steps 


TT1 t^fl 111 TY1 

time steps 


full 

time steps 


all-interval-500 


+ 1.5/0 +U.O/0 


-\-z.6/o +O.0/0 


i o q 0/ no'"' 
+2.0/0 +0.3% 


alpha 


+ 1.2/0 ±O.U7o 


+ 1.270 ±0.070 


i 1 no/ i r\ no/ 

+ 1.2/0 ±0.0/o 


bibd-7-3-60 


+0.17o + 16.97o 


+0.l7o +16.970 


+0.270 +16.97o 


cars 


— 1.47o — 10.47o 


1 CO? i n CO/ 

— 1.5/0 —10.5% 


i oOy i n t 07 

— 1.070 — 10.57o 


crowded-chess-7 


+S.17o +91. 27c 


+9.0% +91.270 


i o oO/ i r\i i 07 

+8.87o +91.1% 


donald-b 


i i i 0/ _i_n ni ' ' 

+ 1.170 ±0.07o 


in i ' ■ ^n no/ 

+0.470 ±U.O/o 


i n no/ i e\ no/ 

+0.97o ±0.07o 


donald-d 


— O.lTo ±0.07o 


r cOZ i r n0/ 

—5.6% +5.0/o 


rr cr07 i c no/ 

— 5.57o +5.07o 


donald-v 


i pC7 _i_n no/ 
+ 1.D70 ±0.07o 


+2.6% ±0.0/0 


i o no/ _i_n mv 

+2.27o ±0.07o 


golomb-10-b 


on i 07 i no r 07" 
— OJ.1% +00.070 


ni oOy i i r\07 

—61.O70 +51.070 


ni r- 07 i r-i r\ 07 

— ol.DTO +51.070 


golomb— 10 — d 


AQ f^oy i 90 907 


— ^y.u/o -p^to. / /o 


A9. Q% -L-AP, 7% 

— ^o.y/o -h^u. i /o 


gr aph— color 


-1.3% ±0.0% 


-1.3% -0.1% 


-0.9% -0.1% 


grocery 


+10.3% -9.2% 


-5.6% +1.9% 


-5.6% +1.9% 


knights-10 


-3.4% -19.5% 


-4.7% -19.9% 


-3.7% -19.9% 


minsort-200 


+0.5% ±0.0% 


+0.5% ±0.0% 


-0.2% ±0.0% 


o-latin-7-d 


-10.6% +4.6% 


-11.0% +4.6% 


-10.6% +4.6% 


partition-32 


-38.6% -18.2% 


-38.7% -17.8% 


-38.4% -17.8% 


photo 


+0.1% +3.4% 


+0.2% +3.4% 


-7.3% -1.8% 


picture 


-1.0% ±0.0% 


-1.5% ±0.0% 


+0.2% ±0.0% 


queens-400 


+0.7% ±0.0% 


+0.5% ±0.0% 


+0.5% ±0.0% 


queens-400-a 


+0.2% ±0.0% 


+0.2% ±0.0% 


+0.3% ±0.0% 


sequence-500 


+0.5% ±0.0% 


+0.3% ±0.0% 


+0.6% ±0.0% 


square-5-d 


-0.5% ±0.0% 


+0.9% +23.8% 


+1.1% +23.8% 


square-7-b 


+ 1.3% ±0.0% 


+0.4% ±0.0% 


-23.4% +22.0% 


square-7-v 


+0.9% ±0.0% 


+0.7% ±0.0% 


+0.9% ±0.0% 


warehouse 


+5.3% +13.0% 


+5.7% +12.0% 


+5.3% +12.0% 


average (all) 


-5.6% +3.8% 


-6.3% +6.4% 


-7.4% +7.0% 



modifies D(x±), so each of /jv and /m are enqued and re-executed for no effect. 
Then executing fo has the same behavior. Overall we execute the propagators 
/m and /at each at least 10 times, as opposed to 3 and 2 times respectively in 
Example 16.21 Since they are the most expensive to execute we would expect 
this to be slower (this is confirmed immediately below). □ 

6.3 Static Priority Experiments 

Priority granularity Table [8] gives runtime and propagation steps of various 
priority granularities compared to a propagation engine using a FIFO queue (and 
using dynamic fixpoint reasoning, events of types {fix, be, dmc}, and fully dy- 
namic event sets) . The three different experiment capture different priority gran- 
ularities: "small" uses three priorities (unary-ternary, linear-quadratic, 
cubic-veryslow); "medium" uses seven priority levels (from unary to veryslow); 
"full" uses 14 priority levels (from unary-high to very slow-low). 

The results illustrate that even when there are substantially more propaga- 
tions (for example, golomb-10-{b,d}) there can be significant savings. Priori- 
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ties can have a very substantial saving (almost 50% for golomb-10-d) and the 
worst case cost on the benchmarks is only 10.3%. Overall while the "medium" 
range of priorities is preferable on many benchmarks, the "full" range of priori- 
ties gives real speedups on two more benchmarks (photo and square-7-b) and 
tends to reduce the worst case behavior of "medium" . 

Examples such as queens-400, queens-400-a, and sequence-500 for "small" 
only feature propagators with the same priority (the number of propagation 
steps remains the same). Hence the increase in runtime by less than 1% de- 
scribes the overhead of using priorities at all. 

Depending on the underlying system, increasing the granularity also requires 
more memory. Gecode, as a system based on recomputation and copying, con- 
stitutes the worst case in that each copied node in the search tree maintains 
queues for all priority levels. However, the average increase in used (not allo- 
cated as before) memory is +0.2% for "small", +0.7% for "medium" , and +1.5% 
for "full" and hence can be neglected. 

A broad spectrum of priorities will be useful for the optimizations presented 
in Section [7J Therefore it is important that while "full" does not offer huge 
advantages over "medium" , it neither degrades overall performance nor requires 
much memory. 

Priorities and stacks Table [5] gives the runtime and propagation steps of 
using priorities together with stacks or combinations of stacks and queues for 
the different priority levels. All numbers are given relative to a propagation 
engine using the "full" priority spectrum with only queues for each priority 
level. All propagation engines considered also use the full priority spectrum. The 
propagation engine for "for all" uses only stacks for all priority levels, whereas 
"for 1,2,3-ary" ("for 1,2-ary") uses stacks for priority levels unary-high to 
ternary-low (unary-high to binary-low) and queues for the other levels. 

The numbers for "for all" clarify that the misbehavior of LIFO stacks is 
not due to priority inversion. The folklore belief that LIFO stacks are good 
for small propagators is refuted by the numbers for "for 1,2,3-ary" and "for 
1, 2-ary". Only three examples show improvement in both cases while grocery 
"for 1, 2, 3-ary" already exhibits pathological behavior. The measurements show 
that queue versus stack does not matter for unary or binary constraints whereas 
stacks are wrong for anything else. 

Issues to avoid Table [10] shows runtime and propagation steps of using pri- 
orities in flawed ways. The experiment "complete fixpoints" refers to the model 
proposed in |14j where fixpoints are always completed before possibly switching 
to a higher priority level. As to be expected, the number of propagation steps 
is reduced, however at the expense of increased runtime. More importantly, 
two examples exhibiting substantial slowdown (golomb-10-d and square-5-d) 
are particularly relevant as they feature propagators of vastly different priority 
levels. 

The surprising behavior of bibd-7-3-60 appears to be due to a problem 
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Table 9: Priority experiments with stacks and priorities. 



Example 


for all 
time steps 


for 1,2,3-ary 
time steps 


for 1 , 2-ary 
time steps 


all-interval-500 


-0.6% ±0.0% 


+1.3% ±0.0% 


±0.2% ±0.0% 


alpha 


+47.2% +99.0% 


±2.3% ±0.0% 


±1.0% ±0.0% 


bibd-7-3-60 


+5.5% +49.7% 


-4.0% ±1.4% 


-5.9% ±1.4% 


cars 


-4.2% -7.3% 


±1.2% -2.0% 


±1.8% -2.0% 


crowded-chess-7 


+0.6% +10.4% 


±2.0% ±0.0% 


±1.7% ±0.0% 


donald-b 


+0.3% ±0.0% 


±1.6% ±0.0% 


±0.9% ±0.0% 


donald-d 


-0.6% ±2.4% 


±1.0% ±0.0% 


±0.3% ±0.0% 


donald-v 


±7.7% ±23.2% 


±2.5% ±0.0% 


±1.7% ±0.0% 


golomb-10-b 


±7.1% ±27.0% 


±9.7% ±27.0% 


±1.2% ±0.0% 


golomb-10-d 


±5.7% ±39.2% 


±7.7% ±39.2% 


±1.0% ±0.0% 


graph-color 


±62.7% ±77.1% 


±2.3% ±0.0% 


±1.5% ±0.0% 


grocery 


±42.8% ±7.1% 


±45.8% ±7.1% 


±1.6% -0.3% 


knights-10 


±0.1% ±5.0% 


±2.1% +5.0% 


±1.4% +5.0% 


minsort-200 


+1545.4% +1106.4% 


+2.2% ±0.0% 


+2.4% ±0.0% 


o-latln-7-d 


-3.3% +5.2% 


±1.0% ±0.3% 


-0.4% +0.3% 


partition-32 


-2.2% -2.9% 


±2.2% +2.3% 


+2.2% +0.4% 


photo 


-1.3% -0.8% 


-1.0% -0.9% 


+0.1% -0.9% 


picture 


+21.1% +79.9% 


-1.3% ±0.0% 


-1.2% ±0.0% 


queens-400 


-0.1% -0.1% 


±1.0% -0.1% 


±0.8% -0.1% 


queens-400-a 


-1.0% -0.3% 


±1.2% ±0.0% 


±1.2% ±0.0% 


sequence-500 


+54.1% +240.9% 


±1.4% ±0.0% 


±0.8% ±0.0% 


square-5-d 


+65.2% +31.6% 


±1.2% ±0.0% 


±0.8% ±0.0% 


square-7-b 


+1.7% +18.6% 


±1.4% ±0.0% 


±0.8% ±0.0% 


square-7-v 


+8.9% +35.2% 


±2.9% ±0.0% 


±2.6% ±0.0% 


warehouse 


-2.5% +1.7% 


±3.0% ±0.2% 


±3.0% ±0.2% 


average (all) 


+23.9% +36.0% 


±3.3% ±2.8% 


±0.8% ±0.2% 
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Table 10: Priority experiments: issues to avoid. 



Example 


complete 


fixpoints 


inverse 


priorities 




time 


steps 


time 


steps 


all-interval-500 


-0.8% 


±0.0% 


±13296.5% 


+32.1% 


alpha 


+0.5% 


±0.0% 


+0.3% 


±0.0% 


bibd-7-3-60 


-11.7% 


-45.8% 


-7.7% 


-42.8% 


cars 


-1.2% 


-0.9% 


+14.4% 


+40.1% 


crowded-chess-7 


-4.2% 


-41.1% 


-7.4% 


-52.0% 


donald-b 


+0.9% 


±0.0% 


+36.8% 


+31.9% 


donald-d 


+0.9% 


±0.0% 


+6.2% 


-7.1% 


donald-v 


+1.6% 


±0.0% 


+0.2% 


±0.0% 


golomb-10-b 


-2.5% 


-14.5% 


+892.0% 


-5.7% 


golomb-10-d 


+14.6% 


-14.2% 


+945.4% 


-2.2% 


graph-color 


+1.9% 


+0.1% 


+336.8% 


+51.1% 


grocery 


+29.3% 


-23.4% 


+35.3% 


+9.1% 


knights-10 


-0.9% 


±0.0% 


+8.7% 


+43.5% 


mlnsort-200 


+5.5% 


±8.6% 


+2732.2% 


+2246.7% 


o-latin-7-d 


+7.3% 


-3.8% 


+105.1% 


+18.7% 


partition-32 


+0.4% 


-1.2% 


+108.3% 


+66.8% 


photo 


+1.2% 


±0.0% 


+18.6% 


-1.9% 


picture 


-3.6% 


±0.0% 


-3.7% 


±0.0% 


queens-400 


-0.7% 


±0.0% 


+0.2% 


±0.0% 


queens-400-a 


+0.3% 


±0.0% 


-0.3% 


±0.0% 


sequence-500 


+0.2% 


±0.0% 


-0.2% 


±0.0% 


square-5-d 


+32.5% 


±7.9% 


+50.1% 


±7.5% 


square-7-b 


+0.5% 


±0.0% 


+177.4% 


-7.6% 


square-7-v 


+2.6% 


±0.0% 


±0.0% 


±0.0% 


warehouse 


-3.7% 


-9.0% 


±14.7% 


+11.3% 


average (all) 


+2.5% 


-6.6% 


±107.5% 


+18.4% 
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with how the priorities for the two different kinds of propagators used in this 
example (lex and Boolean-sum) are classified. This clarifies that even with a 
rich spectrum of priority levels at disposal it remains difficult to assign priority 
levels to propagators. 

The experiment "inverse priorities" shows numbers for a propagation engine 
where a propagator with lowest priority is executed first. The experiment shows 
that there is a point to the priority levels, high priority = fast propagator. While 
the number of propagations is often reduced the approach is rarely better than 
no priorities and sometimes catastrophically worse. 

The experiment "inverse priorities" clarifies a very important aspect of pri- 
orities: they not only serve as a means to improve performance, they also serve 
as a safeguard against pathological propagation order. 

6.4 Dynamic Priorities 

As evaluation proceeds, variables become fixed and propagators can be replaced 
by more specialized versions. If a propagator is replaced by a more specialized 
version, also its priority should change. 

Example 6.4 [Updating a propagator] Consider the propagator fo for up- 
dating x\ in the constraint x\ = X2 + £3 defined by 

fo{D)(xi) = D(x\) n [inf ,0(2:2) + info (13) ■■ sup D (x 2 ) + sup D (x 3 )} 
fo(D){x) = D(x) x + x x 

might have initial priority ternary. When the variable xi becomes fixed to di 
say, then the implementation for x\ can change to 

fo(D)(x 1 ) = D(xx) n [d 2 + inf D (x 3 ) •• d 2 + sup D (x 3 )} 

and the priority can change to binary. □ 

Changing priorities is also relevant when a propagator with n > 3 variables 
with priority linear (or worse) reduces to a binary or ternary propagator. 

6.5 Dynamic Priority Experiments 

Table [TT] shows runtime and propagation steps for an engine using dynamic 
priorities compared to an engine using all optimizations introduced earlier and 
the full priority spectrum. Dynamically changing the priority of propagators as 
they become smaller due to fixed variables can lead to significant improvements. 
In effect, constraints that become smaller (and thus run at higher priority) are 
run first causing the still large constraints to be run less often. 

This is in particular true for alpha with initially only propagators for lin- 
ear equalities with priority linear. When fixing variables during search many 
of these propagators are then run at priority levels binary and ternary. It 
is worth noting that using dynamic priorities can disturb the FIFO queue be- 
havior: for sequence-500 it appears to be more important to run all exactly 
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Table 11: Dynamic priority experiments. 



Example 


dynamic 




time steps 


alpha 


-25.5% -41.7% 


cars 


-5.9% -13.4% 


crowded-chess-7 


-5.0% -26.5% 


o-latin-7-d 


-7.3% -7.4% 


picture 


-1.9% ±0.0% 


sequence-500 


+6.0% +34.8% 


average (above) 


-7.1% -12.1% 


average (all) 


-1.7% -3.0% 



propagators at the same priority level. Running some of the exactly propaga- 
tors at priorities binary and ternary is not beneficial and disturbs the queue 
behavior. 

Dynamic priorities incur the overhead to compute the priority based on the 
number of not yet fixed variables. However, the overhead is still small enough 
to make dynamic priorities worthwhile overall. 

7 Combining Propagation 

There are many ways to define a correct propagator / for a single constraint 
c: the art of building propagators is to find good tradeoffs in terms of speed 
of execution versus strength of propagation. Typically a single constraint may 
have a number of different propagator implementations: the cheapest simple 
propagator, a more complex bounds propagator, and a more complex domain 
propagator, for example. 

Example 7.1 [alldif f erent propagators] Consider the propagator fp(D) 
for the alldif f erent constraint. 

E := 

for i G [1 .. n] 

if (3d.D(x t ) = {d}) 

if (d e E) return D ± else E := E U {d} 
for i G [1 .. n] 

if (\D( Xi )\ > 1) Dfa) := D{ Xi )-E 
return D 

The propagator does a linear number of set operations in each invocation and is 
checking. It can be made idempotent by testing that no variable becomes fixed. 

Another propagator for the same constraint is the domain propagator /jv 
introduced in [29 with complexity 0(n 2 - 5 ). □ 
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Given two propagators, say fx and fa in prop(c), where fx is strictly stronger 
than /2 (/i(-D) C /2(D) for all domains D), we could choose to implement c by 
just /1 or just /2 trading off pruning versus execution time. 

Without priorities there is no point in implementing the constraint c using 
both propagators, since fx will always be run and always compute stronger 
domains than /2. 

We could possibly merge the implementation of the propagators to create 
a new propagator /12(D) = /i(/2 (Dj). By running the cheaper propagator 
immediately first we hope that we can (a) quickly determine failure in some cases 
and (b) simplify the domains before applying the more complicated propagator 
fx- While this immediate combination of two propagators in essence is simply 
building a new propagator, once we have priorities in our propagation engine 
we can use two or more propagators for the same constraint in different ways. 

7.1 Multiple Propagators 

Once we have priorities it makes sense to use multiple propagators to implement 
the same constraint. We can run the weaker (and presumably faster) propagator 
/2 with a higher priority than fx- This makes information available earlier to 
other propagators. When the stronger propagator fx is eventually run, it is able 
to take advantage from propagation provided by other cheaper propagators. 

Note that this is essentially different from having a single propagator /i 2 
that always first runs the algorithm of fi and then the algorithm of fx . 

Example 7.2 [Multiple alldif f erent] Consider the two propagators fp and 
Jn defined in Example [7J] above. We can use both propagators: fp with priority 
linear, and /at with priority quadratic. This means that we will not invoke 
/at until we have reached a fixpoint of fp and all linear and higher priority 
propagators. 

Consider the additional propagator for the constraint 3a; 1 = 2x2, which 
has priority binary. Consider the domain D where D(x\) = {4,6}, -D(x 2 ) = 
{6,9}, D(x 3 ) = {6,7} and D{x±) = ■ ■ ■ = D(x n ) = [1 .. n], which is a fixpoint 
for /b, fp and /at. Now assume the domain of X3 is reduced to 6. Propagator 
fp is placed in queue linear and /jy is placed in queue quadratic. Applying 
fp removes 6 from the domain of all the domains of xx, X2, xx, . . . , i„, and this 
causes to be placed in queue binary. This is the next propagator considered 
and it causes failure. Propagator $n is never executed. 

If we just use /jv then we need to invoke the more expensive /at to obtain 
the same domain changes as fp, and then fail. □ 

7.2 Staged Propagators 

Once we are willing to use multiple propagators for a single constraint it becomes 
worth considering how to more efficiently manage them. Instead of using two 
(or more) distinct propagators we can combine the several propagators into a 
single propagator with more effective behavior. 
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We assume that a propagator has an internal state variable, called its stage. 
When it is invoked, the stage determines what form of propagation applies. 

Example 7.3 [Staged alldif f erent] Consider the alldif f erent constraint 
with implementations fp and /at discussed in Example 17. 11 We combine them 
into a staged propagator as follows: 

• On a fix(a;) event, the propagator is moved to stage A, and placed in the 
queue with priority linear. 

• On a dmc(i) event, unless the propagator is in stage A already, the prop- 
agator is put in stage B, and placed in the queue with priority quadratic. 

• Execution in stage A uses fp, the propagator is put in stage B, and placed 
in the queue with priority quadratic, unless it is subsumed. 

• Execution in stage B uses /jv, afterwards the propagator is removed from 
all queues (stage NONE). 

The behavior of the staged propagator is identical to the multiple propa- 
gators for the sample execution of Example 17.11 In addition to the obvious 
advantage of having a single staged propagator, another advantage comes from 
avoiding the execution of /at when the constraint is subsumed. □ 

In addition to giving other propagators with higher priority the opportunity 
to run before the expensive part of a staged propagator, the first stage of a 
propagator can already determine that the next second does not need to be 
run. This is illustrated by the following example. 

Example 7.4 [Staged linear equations] Consider the unit coefficient linear 
equation Y,™ =1 aiXi = d constraint where |<2j| = 1, 1 < i < n. We have two im- 
plementations, /q, which implements bounds(R) consistency (considering real 
solutions, with linear complexity) for the constraint, and fp, which implements 
domain consistency (with exponential complexity). 

We combine them into a staged propagator as follows: 

• On a bc(x) (or depending on the event types available: lbc(x) or ubc(a;)) 
event, the propagator is moved to stage A, and placed in the queue with 
priority linear. 

• On a dmc(s) event, unless the propagator is in stage A already, the propa- 
gator is put in stage B, and is placed in the queue with priority veryslow. 

• Execution in stage A uses /q, afterwards the propagator is put in stage B, 
and placed in the queue with priority veryslow, unless each Xi has a range 
domain in which case it is removed from all queues (stage NONE). 

• Execution in stage B uses fp, afterwards the propagator is removed from 
all queues (stage NONE). 
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Table 12: Combination experiments. 



Example 


immediate 
time steps 


multiple 

time steps 


staged 
time steps 


donald-b 


+0.7% ±0.0% 


-15.4% ±10.4% 


-16.9% +10.1% 


donald-d 


-1.6% -2.4% 


±0.0% +111.9% 


-1.6% +88.1% 


golomb-10-b 


+0.3% ±0.0% 


-10.1% +9.2% 


-9.8% +9.2% 


golomb-10-d 


-2.7% ±0.2% 


-14.5% +8.6% 


-16.1% +8.6% 


graph-color 


-0.6% -0.6% 


+2.0% +23.0% 


-14.4% +12.3% 


o-latin-7-d 


-1.2% +1.0% 


-9.6% +14.3% 


-14.0% +10.9% 


partition-32 


-0.6% ±0.0% 


-4.3% +7.4% 


-6.2% +4.2% 


photo 


±0.0% ±0.0% 


-0.9% +7.8% 


-1.5% +7.8% 


picture 


+0.3% ±0.0% 


-0.1% ±0.0% 


-3.4% ±0.0% 


square-5-d 


-29.6% -7.4% 


-38.3% ±28.8% 


-42.8% +56.0% 


square-7-b 


-0.1% ±0.0% 


-17.5% -11.0% 


-18.6% -10.9% 


average (above) 


-3.6% -0.9% 


-10.7% ±16.1% 


-14.0% +15.3% 


average (all) 


-1.6% -0.4% 


-4.8% +6.8% 


-6.5% +6.5% 



The staged propagator is advantageous since the "fast" propagator /q can 
more often determine that its result D' = /q(D) is also a fixpoint for □ 

Staged propagators are widely applicable. They can be used similarly for the 
bounds(Z) version of the alldif f erent constraint. Another area where staged 
propagators can be used is constraint-based scheduling, where typically different 
propagation methods with different strength and efficiency are available [2] . 

Staging is not limited to expensive propagators, it is already useful for binary 
(for example, combining bounds and domain propagation for the absolute value 
constraint abs(a;) = y) and ternary constraints (for example, combining bounds 
and domain propagation for the multiplication constraint x x y = z). 

It is important to note that staging requires a sufficiently rich spectrum of 
priorities. For example, to use staging for binary or ternary propagators as men- 
tioned above, at least two different priority levels must be available for staging. 
This explains why the full priority spectrum is useful: here, for binary prop- 
agators two priorities binary-high and binary-low are available. Likewise, 
ternary-high and ternary-low are available for ternary propagators. 

7.3 Combining Propagation Experiments 

Table [T3 presents runtime and propagation steps of different propagator combi- 
nation schemes compared to a propagation engine using the full priority spec- 
trum and all optimizations presented so far. The experiment "immediate" uses 
a single propagator that always runs the first stage immediately followed by 
the second stage. For experiment "multiple" , multiple propagators for different 
stages (as discussed in Section f7TT|) are used, whereas for experiment "staged" 
full staging is used (as described in Section [72]) . 

A quite surprising result is that "immediate" offers only modest or even 
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no speedup. The only exception is square-5-d using bounds(Kl) propagation 
immediately before domain propagation for several linear equation propagators. 

Using multiple propagators leads to an average reduction in runtime by 10% 
with a slowdown for just a single example (graph-color). As to be expected, 
the number of propagation steps rises sharply. This is due to the fact that more 
propagators need to be run and that, similar to the introduction of priorities, 
propagators with high priority are run more often. The exceptional case of 
square-7-b where the number of propagation steps decreases appears to be a 
fortunate change in propagation order. 

Staged propagation offers another level of improvement over using multiple 
propagators: all examples now achieve speedup. As fewer propagators must 
be executed compared to "multiple" , also the number of propagation steps de- 
creases (apart from square-5-d being another case of changing propagation 
order). Due to the reduced overhead compared to "multiple" , even small exam- 
ples such as graph-color are able to benefit from staged execution. 

The exact improvement in runtime of donald-d for "immediate" and "staged" 
is due to early fixpoint detection for a big linear equation propagator as dis- 
cussed in Example 17.41 

The memory requirements for "immediate" and "staged" are unchanged. 
The use of multiple propagators for "multiple" leads to an average increase of 
6.4% in allocated memory for the examples shown in Table [T21 

In summary, staged propagation is very effective for all examples, so it should 
clearly be used. 

8 Experiment Summary 

In Table[Tn]we summarize the effect of all improvements suggested in this paper. 
The naive propagation engine is compared to an engine featuring all techniques 
introduced in this paper: dynamic fixpoint reasoning, {dmc, fix, be} fully dy- 
namic events, dynamic priority based LIFO queuing with the full priority spec- 
trum, and staged propagators. 

It is interesting to note that all examples but warehouse show an improve- 
ment in runtime and that almost 75% of the examples show an improvement 
of at least 10%. The improvement in runtime does not incur a large increase 
in memory: the largest increases arc for the three donald-* problems, where 
the increase is actually negligible in absolute terms and due to the underlying 
memory allocation strategy (as discussed in Section . 

The effects of the individual optimizations discussed in this paper could be 
summarized as follows. Dynamic fixpoint reasoning subsumes static reasoning 
and is easy to implement, it provides a modest improvement in execution times. 
Events, while used in all finite domain propagation engines, have less benefit 
than perhaps was assumed by developers. Using dynamic events again leads to a 
modest improvement in execution times. The fairness of a FIFO queue strategy 
is essential for scheduling propagators. While priorities by themselves are not 
that important they provide a protection against worst case behavior and enable 
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Tabic 13: Experiment summary. 



Example 


no optimizations 


all optimizations 




time (ms) 


mem (KB) 


time 


memory 


all-interval-500 


118.31 


385.4 


-36.1% 


+25.0% 


alpha 


106.56 


22.2 


-31.2% 


+9.8% 


bibd-7-3-60 


2 279.36 


6 688.6 


-11.0% 


+5.8% 


cars 


4.64 


41.7 


-7.7% 


+1.2% 


crowded-chess-7 


624.25 


196.9 


-8.1% 


-6.4% 


donald-b 


0.70 


7.4 


-25.7% 


+19.0% 


donald-d 


30.37 


3.2 


-13.0% 


+68.2% 


donald-v 


0.38 


5.4 


-3.3% 


+44.4% 


golomb-10-b 


1347.48 


40.0 


-38.5% 


+9.3% 


golomb-10-d 


2 430.00 


37.0 


-43.7% 


+4.8% 


graph-color 


35.87 


832.4 


-19.9% 


+9.0% 


grocery 


55.41 


7.7 


-5.9% 


+7.9% 


knlghts-10 


7.46 


770.3 


-14.7% 


+4.2% 


minsort-200 


342.48 


32 454.5 


-58.8% 


-5.1% 


o-latin-7-d 


574.36 


242.9 


-32.5% 


+5.5% 


partition-32 


8 571.24 


160.4 


-40.4% 


+9.2% 


photo 


108.87 


37.0 


-23.8% 


+10.1% 


picture 


1553.42 


450.5 


-3.0% 


+18.0% 


queens-400 


4433.12 


30 286.1 


-87.6% 


+0.2% 


queens-400-a 


16.15 


566.3 


-10.3% 


+4.4% 


sequence-500 


517.96 


6 081.5 


-79.8% 


-49.4% 


square-5-d 


33 391.24 


43.5 


-44.2% 


+11.9% 


square-7-b 


10166.24 


160.3 


-41.2% 


+8.7% 


square-7-v 


5 690.00 


144.2 


-3.9% 


+11.7% 


warehouse 


0.74 


29.4 


+1.2% 


+1.0% 


average (all) 






-33.3% 


+7.2% 
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the use of multiple propagators. Staging is an important optimization that can 
significantly improve performance. 

9 Conclusion and Future Work 

We have given a formal definition of propagation systems including idcmpotence, 
events, and priorities used in current propagation systems and have evaluated 
their impact. We have introduced dynamically changing event sets which are 
shown to improve efficiency considerably. The paper has introduced multiple 
and staged propagators which are shown to be an important optimization in 
particular for improving the efficiency of costly global constraints. 

While the improvements to an engine of a propagation based constraint 
solver have been discussed for integer constraints, the techniques readily carry 
over to arbitrary constraint domains such as finite sets and multisets. 

A rather obvious way to further speed up constraint propagation is to con- 
sider not only cost but also estimated impact for a propagator. However, while 
computing cost is straightforward it is currently not clear to us how to accurately 
predict propagation impact. 

A Examples Used in Experiments 

All variants of constraint propagation discussed in the paper are experimentally 
evaluated. The characteristics of the examples used in evaluation are summa- 
rized in Table 1141 The column "variables" gives the number of variables in the 
example, whereas the column "propagators" shows the number of propagators 
as implementations of constraints in the example. The column "search" shows 
which search strategy is used to search for a solution ("first" is simple back- 
tracking search for the first solution, "all" is search for all solutions, "best" is 
branch- and-bound search for a best solution). The two last columns describe 
how many failed nodes are explored during search (column "failures" ) and how 
many solutions are found (column "solutions" ) . 

A -d at the end of the example name means that domain propagation is used 
for all occurring alldif f erent and linear equation constraints. Likewise, -b 
means that bounds(Z) propagation is used for all alldif f erent and bounds(M) 
for all linear equation constraints. In contrast, for -v bounds(K) propagation is 
used for all linear constraints, whereas naive propagation (eliminating assigned 
values as in Example 17. ip is used for alldif f erent. 

If not otherwise mentioned, bounds consistency is used for arithmetic con- 
straints (including linear constraints) and naive propagation for alldif f erent. 

• all-interval-500 computes a series of numbers where the distances be- 
tween adjacent numbers are pairwise distinct (prob007 in |10j). The model 
uses a single bounds(Z) consistent alldifferent propagator and many 
binary absolute value (abs(.x) = y) and ternary minus propagators. 
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Table 14: Example characteristics. 



Example 


variables 


propagators 


search 


failures 


solutions 


all-interval-500 


1498 


1002 


first 





1 


alpha 


26 


21 


all 


7435 


1 


bibd-7-3-60 


11760 


9 693 


first 


1306 


1 


cars 


60 


93 


all 


107 


6 


crowded-chess-7 


163 


275 


first 


30 396 


1 


donald-b 


10 


2 


first 


79 


1 


donald-d 


10 


2 


first 


5 


1 


donald-v 


10 


2 


first 


79 


1 


golomb-10-b 


46 


46 


best 


19 929 


10 


golomb-10-d 


46 


46 


best 


19 929 


10 


graph-color 


201 


566 


first 


37 


1 


grocery 


7 


7 


first 


37 


1 


knights-10 


2 028 


2 981 


first 


2 


1 


minsort-200 


399 


398 


first 





1 


o-latin-7-d 


147 


133 


first 


2188 


1 


partition-32 


128 


134 


first 


160 258 


1 


photo 


61 


54 


best 


6 995 


7 


picture 


625 


50 


first 


3 242 


1 


queens-400 


400 


239 400 


first 


10 


1 


queens-400-a 


400 


3 


first 


10 


1 


sequence-500 


500 


502 


all 


250 


1 


square-5-d 


25 


15 


first 


41272 


1 


square-7-b 


49 


19 


first 


245 208 


1 


square-7-v 


49 


19 


first 


481301 


1 


warehouse 


81 


76 


best 


20 


4 



40 



alpha and donald are crypto-arithmetic puzzles involving linear equation 
propagators and a single alldif f erent propagator. 

bibd-7-3-60 is an instance of a balanced incomplete block design problem 
with parameters (v, k, I) = (7, 3, 60) (prob028 in [TU])- The model involves 
Boolean-sum propagators and lex propagators for symmetry breaking. 

cars models the well known car sequencing problem from [35] using element, 
exactly, and linear equation propagators (probOOl in |10jV 

crowded-chess-7 places several different chess pieces on a 7 x 7 chess- 
board [1 1| . It uses exactly, element, domain consistent alldif f erent, 
and bounds (K.) consistent linear equation propagators. 

golomb-10 finds an optimal Golomb ruler of size 10 (prob006 in [TUj) with 
the usual model. 

graph-color performs clique-based graph coloring for a graph with 200 
nodes. Coloring each clique uses a domain consistent alldif f erent prop- 
agator. 

grocery is a small crypto-arithmetic puzzle using in particular bounds(K) 
consistent multiplication propagators. 

knights-10 finds a sequence of knight moves on a 10 x 10 chess board such 
that each field is visited exactly once and that the moves return the knight 
to the starting field. The model uses a naive alldif f erent propagator 
and a large number of reified binary propagators. 

minsort-200 sorts 200 variables using 200 minimum propagators. 

o-latin-7 finds an orthogonal latin square of size 7 and mostly uses 
domain consistent alldif f erent propagators. 

partition-32 partitions two 32 number blocks such that their products 
match. Uses several bounds(M) multiplication propagators, a single do- 
main consistent alldif f erent propagator, and few linear equation prop- 
agators. 

photo places 9 persons on a picture such that as many preferences as 
possible are satisfied. Uses a large bounds(Z) consistent alldifferent 
propagator, a large bounds(R) consistent linear propagator, and many 
reified binary propagators. 

picture models a 25x25 picture-puzzle (prob012 in [10] ) using 50 regular 
propagators. 

queens-400 and queens-400-a places 400 queens on a 400 x 400 chess 
board such that the queens do not attack each other, queens-400 uses 
quadratically many binary disequality propagators, while queens-400-a 
uses three naive alldif ferent-propagators. 
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• sequence-500 computes a magic sequence with 500 elements using 500 
exactly propagators (prob019 in [10]L 

• square-5 (square-7) computes a magic square of size 5 x 5 (7 x 7) us- 
ing linear equation propagators and a single alldif f erent propagator 
(prob019 in [TO]). 

• warehouse solves a warehouse location problem following [32] . 

B Evaluation Platform 

All experiments use Gecode, a C++-based constraint programming library [12] . 
Gecode is one of the fastest constraint programming systems currently available, 
benchmarks comparing Gecode to other systems are available from Gecode's 
webpage. The version used in this paper corresponds to Gecode 1.3.0 (albeit 
slightly modified to ease the numerous experiments in this paper). Gecode has 
been compiled with Microsoft Visual Studio Express Edition 2005. 

All examples have been run on a Laptop with a 2 GHz Pentium M CPU 
and 1024 MB main memory running Windows XP. Runtimes are the average of 
25 runs with a coefficient of deviation less than 4% for all benchmarks. 

Acknowledgments 

Christian Schulte is partially funded by the Swedish Research Council (VR) 
under grant 621-2004-4953. We thank Mikael Lagerkvist and Guido Tack for 
many helpful suggestions that improved the paper. 

References 

[1] Krzysztof Apt. Principles of Constraint Programming. Cambridge Univer- 
sity Press, Cambridge, United Kingdom, 2003. 

[2] Philippe Baptiste, Claude Le Pape, and Wim Nuijten. Constraint-based 
Scheduling. Kluwer Academic Publishers, Dordrecht, The Netherlands, 
2001. 

[3] Nicolas Beldiceanu, Warwick Harvey, Martin Henz, Frangois Laburthe, Eric 
Monfroy, Tobias Muller, Laurent Perron, and Christian Schulte. Proceed- 
ings of TRICS: Techniques foR Implementing Constraint programming Sys- 
tems, a post-conference workshop of CP 2000. Technical Report TRA9/00, 
School of Computing, National University of Singapore, September 2000. 

[4] Frederic Benhamou. Heterogeneous Constraint Solving. In Proceedings of 
the Fifth International Conference on Algebraic and Logic Programming, 
volume 1139 of LNCS, pages 62-76, Aachen, Germany, 1996. Springer- 
Verlag. 



42 



[5] Mats Carlsson and Nicolas Beldiceanu. Revisiting the lexicographic order- 
ing constraint. Technical Report T2002-17, Swedish Institute of Computer 
Science, Stockholm, Sweden, 2002. 

[6] Mats Carlsson, Greger Ottosson, and Bjorn Carlson. An open-ended finite 
domain constraint solver. In Programming Languages: Implementations, 
Logics, and Programs, 9th International Symposium, PLILP'97, volume 
1292 of LNCS, pages 191-206, Southampton, United Kingdom, September 
1997. Springer- Verlag. 

[7] Andre Chamard, Annie Fischler, Dominiquc-Benoit Guinaudeau, and An- 
dre Guillard. CHIC lessons on CLP methodology. Technical report, Das- 
sault Aviation, 1995. 

[8] Chiu Wo Choi, Warwick Harvey, Jimmy Ho-Man Lee, and Peter J. 
Stuckey. Finite domain bounds consistency revisited. Technical report, 
http : //arxiv . org/abs/cs . AI/0412021, 2004. 

[9] Philippe Codognet and Daniel Diaz. Compiling constraints in clp(FD). 
Journal of Logic Programming, 27(3):185-226, June 1996. 

[10] CSPLib. CSPLib: a problem library for constraints, 2006. Available from 
http : //www. csplib. org. 

[11] Henry E. Dudeney. Amusements in Mathematics. Dover, New York, NY, 
USA, 1958. 

[12] Gecode Team. Gecode: Generic constraint development environment, 2006. 
Available from http : //www . gecode . org. 

[13] Ian P. Gent, Chris Jefferson, and Ian Miguel. Watched literals for constraint 
propagation in Minion. In Frederic Benhamou, editor, Twelfth Interna- 
tional Conference on Principles and Practice of Constraint Programming, 
volume 4204 of LNCS, pages 182-197. Springer- Verlag, Nantes, France, 
September 2006. 

[14] Laurent Granvillicrs and Eric Monfroy. Implementing constraint propaga- 
tion by composition of reductions. In Logic Programming: Proceedings of 
the 19 th International Conference, volume 2916 of LNCS, pages 300-314, 
Mumbai, India, 2003. Springer- Verlag. 

[15] Warwick Harvey. Personal communication, April 2004. 

[16] Warwick Harvey and Peter J. Stuckey. Improving linear constraint prop- 
agation by changing constraint representation. Constraints, 8(2): 173-207, 
2003. 

[17] ILOG S.A. ILOG Solver 5.0: Reference Manual. Century, France, August 
2000. 



43 



[18] Intelligent Systems Laboratory. SICStus Prolog user's manual, 3.11.1. 
Technical report, Swedish Institute of Computer Science, Box 1263, 164 
29 Kista, Sweden, 2004. 

[19] Frangois Laburthe. CHOCO: implementing a CP kernel. In Bcldiceanu 
et al. [3J, pages 71-85. 

[20] Olivier Lhomme, Arnaud Gotlieb, and Michel Rueher. Dynamic optimiza- 
tion of interval narrowing algorithms. Journal of Logic Programming, 37(1- 
3):165-183, 1998. 

[21] Alan K. Mackworth. Consistency in networks of relations. Artificial Intel- 
ligence, 8(1):99-118, 1977. 

[22] Kim Marriott and Peter J. Stuckey. Programming with Constraints: an 
Introduction. The MIT Press, Cambridge, MA, USA, 1998. 

[23] Roger Mohr and Thomas C. Henderson. Arc and path consistency revisited. 
Artificial Intelligence, 28:225-233, 1986. 

[24] Roger Mohr and Gerald Masini. Good old discrete relaxation. In Yves 
Kodratoff, editor, Proceedings of the 8th European Conference on Artificial 
Intelligence (ECAI 88), pages 651-656, Munich, Germany, 1988. Pitmann 
Publishing. 

[25] Matthew W. Moskewicz, Conor F. Madigan, Ying Zhao, Lintao Zhang, and 
Sharad Malik. Chaff: Engineering an efficient SAT solver. In Proceedings 
of the 38th Design Automation Conference, DAC 2001, pages 530-535, Las 
Vegas, NV, USA, 2001. ACM. 

[26] Mozart Consortium. The Mozart programming system, 1999. Available 
from www . mozart-oz . org. 

[27] Gilles Pesant. A regular language membership constraint for finite se- 
quences of variables. In Mark Wallace, editor, Tenth International Confer- 
ence on Principles and Practice of Constraint Programming, volume 3258 
of LNCS, pages 482-495. Springer- Verlag, Toronto, Canada, September 
2004. 

[28] Jean- Francois. Puget. A fast algorithm for the bound consistency of alldiff 
constraints. In Proceedings of the 15th National Conference on Artificial 
Intelligence (AAAI-98), pages 359-366, Madison, WI, USA, July 1998. 
AAAI Press/The MIT Press. 

[29] Jean-Charles Regin. A filtering algorithm for constraints of difference in 
CSPs. In Proceedings of the Twelfth National Conference on Artificial In- 
telligence, volume 1, pages 362-367, Seattle, WA, USA, 1994. AAAI Press. 

[30] Pierre Saveant. Constraint reduction at the type level. In Bcldiceanu et al. 
[3J, pages 16-29. 



44 



[31] Christian Schultc and Peter J. Stuckey. When do bounds and domain 
propagation lead to the same search space? Transactions on Programming 
Languages and Systems, 27(3):388-425, May 2005. 

[32] Pascal Van Hentenryck. The OPL Optimization Programming Language. 
The MIT Press, Cambridge, MA, USA, 1999. 

[33] Pascal Van Hentenryck, Vijay Saraswat, and Yves Deville. Constraint 
processing in cc(FD). Draft, 1991. 

[34] Pascal Van Hentenryck, Vijay Saraswat, and Yves Deville. Design, imple- 
mentation and evaluation of the constraint language cc(FD). Journal of 
Logic Programming, 37(1-3): 139-164, 1998. 

[35] Pascal Van Hentenryck, Helmut Simonis, and Mehmct Dincbas. Constraint 
satisfaction using constraint logic programming. Artificial Intelligence, 
58:113-159, 1992. 

[36] Mark Wallace, Stefano Novello, and Joachim Schimpf. Eclipse: A platform 
for constraint logic programming. Technical report, IC-Parc, Imperial Col- 
lege, London, GB, August 1997. 

[37] Neng-Fa Zhou. Programming finite-domain constraint propagators in ac- 
tion rules. Theory and Practice of Logic Programming, 6(5):483-508, 
September 2006. 



45 



